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METHODS FOR IDENTIFYING RISK OF OSTEOARTHRITIS AND 
TREATMENTS THEREOF 



f^lH nf the Invention 

fmi The invention rela.es ,o g e„e„c methods for identifying risk of osteons and treatments 
that specifically target such diseases. 
Background 

,0002, Osteoarthritis (OA) is a chrome disease usually affecting weight-bearing synovu.1 J** 
There arc approximately 20 million Americans affected by OA and it is the leading cause of disability in 
me United States, ha addition to extensive human suffering, OA also accounts for nearly all kmee 
replacements and more man half of .1, hip replacements ,„ the United States Despite its prevalence OA 
is p„„r,y understood and there are few treatments available besides anti-innammatory drugs and join, 

"""Most common,, affecting m,dd,e-aged and older people, OA can range fron , very mild to 
very severe. 1. affects bands and weight-bearingjoints such as knees, hips, feet and the back. Knee OA 
can be as disabling as any cardiovascular disease except stroke. 

,0004, OA is characterized by the breakdown of cartilage in joints. Cartilage ,n joints cush ons the 
ends of bones, and cartilage breakdown causes bones to rob against each other, causing P am and loss of 

approximately half the dry weigh,, and representing 90-95 % of the tota, collagen content ~ » 
forms fibrils that endow cartilage with tensile strength (Mayne, R. Arthntis Rhuem. 32.24,-246 (1989)). 

Summary 

,0005] I. has been discovered that certain polymorphic variations m human genomic DN A are 
associated with osteoarthritis. In particular, polymorphic variants in a locus containing a ADAMTS2 
region in human genomic DNA have been associated with nsk of osteoarthritis 

,0006! Thus, featured herein are methods for identifying , subject a, risk of osteoarthritis and/or 
risk of osteoarthritis in a subject, which comprise deteefing the presence or absence of one or more 
polymorphic variations associated with osteoarthritis in or around the leci described herein ,n a human 
ILL sample, h, an embodiment, Pwo or more pol^hie vanafions are ™ 
regions of which one is the ADAMTS2 region. In certain embodiments, 3 or more, or 4, 5, 6, 7, 8, 9, 10, 
1 1, 12, 13, 14, 15, lb, 17, 18, 19 or 20 or more polymorphic variants are detected. 
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|00071 Also featured are nucleic acids that include one or more polymorphic variations associated 
with occurrence of osteoarthritis, as well as polypeptides encoded by these nucleic acids. In addition, 
provided are methods for identifying candidate therapeutic molecules for treating osteoarthritis, as well 
as methods for treating osteoarthritis in a subject by identifying a subject at risk of osteoarthritis and 
treating the subject with a suitable prophylactic, treatment or therapeutic molecule. 

|0008] Also provided are compositions comprising a cell from a subject having osteoarthritis or at 
risk of osteoarthritis and/or a ADAMTS2 nucleic acid, with a RNAi, siRNA, antisense DNA or RNA, or 
ribozyme nucleic acid designed from a ADAMTS2 nucleotide sequence. In an embodiment, the RNAi, 
siRNA, antisense DNA or RNA, or ribozyme nucleic acid is designed from a ADAMTS2 nucleotide 
sequence that includes one or more polymorphic variations associated with osteoarthritis, and in some 
instances, specifically interacts with such a nucleotide sequence. Further, provided are arrays of nucleic 
acids bound to a solid surface, in which one or more nucleic acid molecules of the array have a 
ADAMTS2 nucleotide sequence, or a fragment or substantially identical nucleic acid thereof, or a 
complementary nucleic acid of the foregoing. Featured also are compositions comprising a cell from a 
subject having osteoarthritis or at risk of osteoarthritis and/or a ADAMTS2 polypeptide, with an antibody 
that specifically binds to the polypeptide. Thus, featured is an antibody that specifically binds to an 
epitope in the polypeptide that includes an amino acid encoded by a polymorphic site associated with 
osteoarthritis. In certain embodiments, the antibody specifically binds to an epitope comprising a valine 
or isoleucine encoded by rs398829 (e.g., an antibody that binds to an epitope comprising a valine at 
position 245 in an ADAMTS2 polypeptide) A valine at position 245 is associated with increased risk of 
osteoarthitis. 

Brief Description of the Drawings 

[0009] Figure 1 shows proximal SNPs in a ADAMTS2 region in genomic DNA. The position of 
each SNP in the chromosome is shown on the x-axis and the y-axis provides the negative logarithm of 
the p-value comparing the estimated allele to that of the control group. Also shown in the figure are 
exons and introns of the region in the approximate chromosomal positions. 

Detailed Description 

fOOlO] It has been discovered that a polymorphic variant in a locus containing a ADAMTS2 region is 
associated with occurrence of osteoarthritis in subjects. Thus, detecting genetic determinants associated 
with an increased risk of osteoarthritis occurrence can lead to early identification of a predisposition to 
osteoarthritis and early prescription of preventative measures. Also, associating a ADAMTS2 
polymorphic variant with osteoarthritis has provided new targets for screening molecules useful in 
treatments of osteoarthritis. 
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Osteoarthritis and Sample Selection 
[0011] Osteoarthritis (OA), or degenerative joint disease, is one of the oldest and most common 
types of arthritis. It is characterized by the breakdown of the joint's cartilage. Cartilage is the part of the 
joint that cushions the ends of bones, and its breakdown causes bones to rub against each other, causing 
pain and loss of movement. Type II collagen is the main component of cartilage, comprising 15-25% of 
the wet weight, approximately half the dry weight, and representing 90-95% of the total collagen content 
in the tissue. It forms fibrils that endow cartilage with tensile strength (Mayne, R. Arthritis Rhuem. 
32:241-246(1989)). 

[0012] Most commonly affecting middle-aged and older people, OA can range from very mild to 
very severe. It affects hands and weight-bearing joints such as knees, hips, feet and the back. Knee OA 
can be as disabling as any cardiovascular disease except stroke. Whereas Ehlers-Danlos syndrome type 
VIIC is characterized by the retention of the N-terminal propeptide of type I collagen, osteoarthritis has 
been characterized by increased levels of type II collagen in osteoarthritic cartilage as measured by 
elevated C-propeptide concentrations (Nelson et al. (1998) J. Clin. Invest. 102(12):21 15-2125). 

[0013] Osteoarthritis affects an estimated 20.7 million Americans, mostly after age 45, with women 
more commonly affected than men. Physicians make a diagnosis of OA based on a physical exam and 
history of symptoms. X-rays are used to confirm diagnosis. Most people over 60 reflect the disease on 
X-ray, and about one-third have actual symptoms. 

[0014] There are many factors that can cause OA. Obesity may lead to osteoarthritis of the knees. 
In addition, people with joint injuries due to sports, work-related activity or accidents may be at increased 
risk of developing OA. 

[0015] Genetics has a role in the development of OA. Some people may be born with defective 
cartilage or with slight defects in the way that joints fit together. As a person ages, these defects may 
cause early cartilage breakdown in the joint or the inability to repair damaged or deteriorated cartilage in 
the joint. 

[0016] Inclusion or exclusion of samples for an osteoarthritis pool may be based upon the following 
criteria: ethnicity (e.g., samples derived from an individual characterized as Caucasian); parental 
ethnicity (e.g., samples derived from an individual of British paternal and maternal descent); relevant 
phenotype information for the individual (e.g., case samples derived from individuals diagnosed with 
specific knee osteoarthritis (OA) and were recruited from an OA knee replacement clinic). Control 
samples may be selected based on relevant phenotype information for the individual (e.g., derived from 
individuals free of OA at several sites (knee, hand, hip etc)); and no family history of OA and/or 
rheumatoid arthritis. Additional phenotype information collected for both cases and controls may include 
age of the individual, gender, family history of OA, diagnosis with osteoarthritis (joint location of OA, 
date of primary diagnosis, age of individual as of primary diagnosis), knee history (current symptoms, 
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any major knee injury, menisectomy, knee replacement surgery, age of surgery), HRT history, 
osteoporosis diagnosis. 

|0017] Based in part upon selection criteria set forth above, individuals having osteoarthritis can be 
selected for genetic studies. Also, individuals having no history of osteoarthritis often are selected for 
genetic studies, as described hereafter. 

Polymorphic Variants Associated with Osteoarthritis 

[0018] A genetic analysis provided herein linked osteoarthritis with polymorphic variant nucleic 
acid sequences in the human genome. As used herein, the term "polymorphic site" refers to a region in a 
nucleic acid at which two or more alternative nucleotide sequences are observed in a significant number 
of nucleic acid samples from a population of individuals. A polymorphic site may be a nucleotide 
sequence of two or more nucleotides, an inserted nucleotide or nucleotide sequence, a deleted nucleotide 
or nucleotide sequence, or a microsatellite, for example. A polymorphic site that is two or more 
nucleotides in length may be 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or more, 20 or more, 30 or more, 50 
or more, 75 or more, 100 or more, 500 or more, or about 1000 nucleotides in length, where all or some of 
the nucleotide sequences differ within the region. A polymorphic site is often one nucleotide in length, 
which is referred to herein as a "single nucleotide polymorphism" or a "SNP." 

[0019] Where there are two, three, or four alternative nucleotide sequences at a polymorphic site, 
each nucleotide sequence is referred to as a "polymorphic variant" or "nucleic acid variant." Where two 
polymorphic variants exist, for example, the polymorphic variant represented in a minority of samples 
from a population is sometimes referred to as a "minor allele" and the polymorphic variant that is more 
prevalently represented is sometimes referred to as a "major allele." Many organisms possess a copy of 
each chromosome (e.g., humans), and those individuals who possess two major alleles or two minor 
alleles are often referred to as being "homozygous" with respect to the polymorphism, and those 
individuals who possess one major allele and one minor allele are normally referred to as being 
"heterozygous" with respect to the polymorphism. Individuals who are homozygous with respect to one 
allele are sometimes predisposed to a different phenorype as compared to individuals who are 
heterozygous or homozygous with respect to another allele. 

[0020] In genetic analysis that associate polymorphic variants with osteoarthritis, samples from 
individuals having osteoarthritis and individuals not having osteoarthritis often are allelotyped and/or 
genotyped. The term "allelotype" as used herein refers to a process for determining the allele frequency 
for a polymorphic variant in pooled DNA samples from cases and controls. By pooling DNA from each 
group, an allele frequency for each SNP in each group is calculated. These allele frequencies are then 
compared to one another. The term "genotyped" as used herein refers to a process for determining a 
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genotype of one or more individuals, where a "genotype" is a representation of one or more polymorphic 
variants in a population. 

[0021 J A genotype or polymorphic variant may be expressed in terms of a "haplotype," which as 
used herein refers to two or more polymorphic variants occurring within genomic DNA in a group of 
individuals within a population. For example, two SNPs may exist within a gene where each SNP 
position includes a cytosine variation and an adenine variation. Certain individuals in a population may 
carry one allele (heterozygous) or two alleles (homozygous) having the gene with a cytosine at each SNP 
position. As the two cytosines corresponding to each SNP in the gene travel together on one or both 
alleles in these individuals, the individuals can be characterized as having a cytosine/cytosine haplotype 
with respect to the two SNPs in the gene. 

[0022] As used herein, the term "phenotype" refers to a trait which can be compared between 
individuals, such as presence or absence of a condition, a visually observable difference m appearance 
between individuals, metabolic variations, physiological variations, variations in the function of 
biological molecules, and the like. An example of a phenotype is occurrence of osteoarthritis. 

[0023] Researchers sometimes report a polymorphic variant in a database without determining 
whether the variant is represented in a significant fraction of a population. Because a subset of these 
reported polymorphic variants are not represented in a statistically significant portion of the population, 
some of them are sequencing errors and/or not biologically relevant. Thus, it is often not known whether 
a reported polymorphic variant is statistically significant or biologically relevant until the presence of the 
variant is detected in a population of individuals and the frequency of the variant is determined. Methods 
for detecting a polymorphic variant in a population are described herein, specifically in Example 2. A 
polymorphic variant is statistically significant and often biologically relevant if it is represented in 5% or 
more of a population, sometimes 10% or more, 15% or more, or 20% or more of a population, and often 
25% or more, 30% or more, 35% or more, 40% or more, 45% or more, or 50% or more of a population. 

[0024] A polymorphic variant may be detected on either or both strands of a double-stranded nucleic 
acid. Also, a polymorphic variant may be located within an intron or exon of a gene or within a portion 
of a regulatory region such as a promoter, a 5' untranslated region (UTR), a 3' UTR, and in DNA (e.g., 
genomic DNA (gDNA) and complementary DNA (cDNA)), RNA (e.g., mRNA, tRNA, and rRNA), or a 
polypeptide. Polymorphic variations may or may not result in detectable differences in gene expression, 
polypeptide structure, or polypeptide function. 

[0025] It was determined that polymorphic variations associated with an increased risk of 
osteoarthritis existed in a ADAMTS2 region in SEQ ID NO: 1 . in certain embodiments, a polymorphic 
variant at position rs398829 in the human genome was associated with an increased risk of osteoarthritis, 
and in a specific embodiment, a guanine at position rs398829 was associated with an increased risk of 
osteoarthritis. 
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[0026] Polymorphic variants in and around the ADAMTS2 locus were tested for association with 
osteoarthritis. These include polymorphic variants at positions in SEQ ID NO: 1 selected from the group 
consisting of 210, 3608, 3609, 4318, 5593, 5629, 5639, 5640, 8943, 17968, 19887, 21034, 21085, , 
21596, 23379, 23432, 24007, 26121, 26273, 26755, 27411, 27710, 27842, 28379, 29603, 31232, 31504, 
32583, 32794, 32840, 33044, 33150, 33218, 33513, 33959, 34486, 36289, 36570, 38247, 38477, 38518, 
38529, 38667, 39781, 39856, 39927, 40506, 41869, 42452, 44788, 46059, 46846, 47712, 48796, 49441, 
49602, 49723, 50050, 50171, 50477, 50818, 50833, 50881, 50882, 51386, 51534, 52317, 52368, 52970, 
53023, 53356, 53882, 54553, 55475, 55530, 55691, 55848, 55879, 56316, 56911, 57320, 57391, 57437, 
57478, 57500, 59111, 59333, 59715, 59804, 59851, 59929, 60052, 60240, 60359, 60381, 60456, 60724, 
60875, 60968, 60978, 60998, 61557, 62091, 62645, 62943, 63131, 63145, 63406, 63427, 63554, 63661, 
64093, 64153, 64409, 64544, 65257, 65626,65739 , 66392, 66720, 69177, 69336, 69636, 69823, 69928, 
70547, 70633, 71805, 72181, 72200, 72474, 72567, 72973, 73468, 73889, 75730, 75970, 76114, 76342, 
76449, 76465, 76791, 78042, 80758, 80778, 81356, 81576, 81689, 81759, 81950, 82562, 83591, 83700, 
83821, 83842, 83923, 83929, 84021, 84175, 84417, 84747, 85746, 86129, 86335, 87315, 87648, 87764, 
87770, 88221, 90474, 91148, 91150, 91160, 91733, 91772, 91785, 93140, 93148, 96080, 96157, 96313, 
96759, 97026, 97320, 97732, 98713, 99707, 99959, 100009, 100020, 100065, 100086, 101270, 101276, 
101371, 101376, 101439, 101820, 102392, 102602, 102604, 102896, 189104, 189134 and 189205. 
Polymorphic variants at the following positions in SEQ ID NO: 1 in particular were associated with an 
increased risk of osteoarthritis: 5640, 33150, 38247, 38529, 46846, 49723, 50050, 63427, 73889, 
189104 and rs428901, where specific embodiments are directed to positions 46846, 73889, 189104 
and/or rs428901. In particular, the following polymorphic variants in SEQ ID NO: 1 were associated 
with risk of osteoarthritis: a cytosine at position 5640, a cytosine at position 33150, an adenine at 
position 38247, a thymine at position 38529, an adenine at position 46846, a cytosine at position 49723, a 
cytosine at position 50050, a cytosine a position 63427, a guanine at position 73889, a thymine at 
position 189104, and an adenine at position rs428901. 

[0027] Based in part upon analyses summarized in Figure 1, a region with significant association 
has been identified in a locus associated with osteoarthritis. Any polymorphic variants associated with 
osteoarthritis in a region of significant association can be utilized for embodiments described herein. For 
example, polymorphic variants in a region spanning chromosome positions 178746000 to 178751000 
(approximately 5,000 nucleotides in length) in a ADAMTS2 locus have significant association 
(chromosome positions are within NCBI's Genome build 34). 

Additional Polymorphic Variants Associated with Osteoarthritis 
[0028] Also provided is a method for identifying polymorphic variants proximal to an incident, 
founder polymorphic variant associated with osteoarthritis. Thus, featured herein are methods for 
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identifying a polymorphic variation associated with osteoarthritis that is proximal to an incident 
polymorphic variation associated with osteoarthritis, which comprises identifying a polymorphic variant 
proximal to the incident polymorphic variant associated with osteoarthritis, where the incident 
polymorphic variant is in a ADAMTS2 nucleotide sequence. The nucleotide sequence often comprises a 
polynucleotide sequence selected from the group consisting of (a) a polynucleotide sequence of SEQ ED 
NO: 1-3; (b) a polynucleotide sequence that encodes a polypeptide having an amino acid sequence 
encoded by a polynucleotide sequence of SEQ ID NO: 1-3; and (c) a polynucleotide sequence that 
encodes a polypeptide having an amino acid sequence that is 90% or more identical to an amino acid 
sequence encoded by a nucleotide sequence of SEQ DD NO: 1-3 or a polynucleotide sequence 90% or 
more identical to the polynucleotide sequence of SEQ ED NO: 1-3. The presence or absence of an 
association of the proximal polymorphic variant with osteoarthritis then is determined using a known 
association method, such as a method described in the Examples hereafter. In an embodiment, the 
incident polymorphic variant is a polymorphic variant associated with osteoarthritis described herein. In 
another embodiment, the proximal polymorphic variant identified sometimes is a publicly disclosed 
polymorphic variant, which for example, sometimes is published in a publicly available database. In 
other embodiments, the polymorphic variant identified is not publicly disclosed and is discovered using a 
known method, including, but not limited to, sequencing a region surrounding the incident polymorphic 
variant in a group of nucleic samples. Thus, multiple polymorphic variants proximal to an incident 
polymorphic variant are associated with osteoarthritis using this method. 

[0029] The proximal polymorphic variant often is identified in a region surrounding the incident 
polymorphic variant. In certain embodiments, this surrounding region is about 50 kb flanking the first 
polymorphic variant (e.g. about 50 kb 5' of the first polymorphic variant and about 50 kb 3' of the first 
polymorphic variant), and the region sometimes is composed of shorter flanking sequences, such as 
flanking sequences of about 40 kb, about 30 kb, about 25 kb, about 20 kb, about 15 kb, about 10 kb, 
about 7 kb, about 5 kb, or about 2 kb 5 ' and 3' of the incident polymorphic variant. In other 
embodiments, the region is composed of longer flanking sequences, such as flanking sequences of about 
55 kb, about 60 kb, about 65 kb, about 70 kb, about 75 kb, about 80 kb, about 85 kb, about 90 kb, about 
95 kb, or about 100 kb 5' and 3' of the incident polymorphic variant. 

[0030] In certain embodiments, polymorphic variants associated with osteoarthritis are identified 
iteratively. For example, a first proximal polymorphic variant is associated with osteoarthritis using the 
methods described above and then another polymorphic variant proximal to the first proximal 
polymorphic variant is identified {e.g., publicly disclosed or discovered) and the presence or absence of 
an association of one or more other polymorphic variants proximal to the first proximal polymorphic 
variant with osteoarthritis is determined. 
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[0031] The methods described herein are useful for identifying or discovering additional 
polymorphic variants that may be used to further characterize a gene, region or loci associated with a 
condition, a disease (e.g., osteoarthritis), or a disorder. For example, allelotyping or genotyping data 
from the additional polymorphic variants may be used to identify a functional mutation or a region of 
linkage disequilibrium. In certain embodiments, polymorphic variants identified or discovered within a 
region comprising the first polymorphic variant associated with osteoarthritis are genotyped using the 
genetic methods and sample selection techniques described herein, and it can be determined whether 
those polymorphic variants are in linkage disequilibrium with the first polymorphic variant. The size of 
the region in linkage disequilibrium with the first polymorphic variant also can be assessed using these 
genotyping methods. Thus, provided herein are methods for determining whether a polymorphic variant 
is in linkage disequilibrium with a first polymorphic variant associated with osteoarthritis, and such 
information can be used in prognosis/diagnosis methods described herein. 

Isolated Nucleic Acids 

[0032] Featured herein are isolated ADAMTS2 nucleic acid variants depicted in SEQ ID NO: 1-3, 
and substantially identical nucleic acids thereof. A nucleic acid variant may be represented on one or 
both strands in a double-stranded nucleic acid or on one chromosomal complement (heterozygous) or 
both chromosomal complements (homozygous). ADAMTS2 exists in two forms, a "long" form 
comprising a molecule approximately 130 kDa in length (e.g., SEQ ID NO: 2 for cDNA sequence and 
SEQ ID NO: 4 for amino acid sequence), and a "short" form comprising a molecule approximately 70 
kDa in length (e.g., SEQ ID NO: 3 for cDNA sequence and SEQ ID NO: 5 for amino acid sequence). 
Provided herein are polynucleotide sequences encoding both the short and long forms of ADAMTS2. 

[0033] As used herein, the term "nucleic acid" includes DNA molecules (e.g., a complementary 
DNA (cDNA) and genomic DNA (gDNA)) and RNA molecules (e.g., mRNA, rRNA, siRNA and tRNA) 
and analogs of DNA or RNA, for example, by use of nucleotide analogs. The nucleic acid molecule can 
be single-stranded and it is often double-stranded. The term "isolated or purified nucleic acid" refers to 
nucleic acids that are separated from other nucleic acids present in the natural source of the nucleic acid. 
For example, with regard to genomic DNA, the term "isolated" includes nucleic acids which are 
separated from the chromosome with which the genomic DNA is naturally associated. An "isolated" 
nucleic acid is often free of sequences which naturally flank the nucleic acid (i.e., sequences located at 
the 5' and/or 3' ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic 
acid is derived. For example, in various embodiments, the isolated nucleic acid molecule can contain less 
than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of 5' and/or 3' nucleotide sequences which flank 
the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived. Moreover, 
an "isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of other cellular 
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material, or culture medium when produced by recombinant techniques, or substantially free of chemical 
precursors or other chemicals when chemically synthesized. As used herein, the term "gene" refers to a 
nucleotide sequence that encodes a polypeptide. 

[0034] Also included herein are nucleic acid fragments. These fragments often have a nucleotide 
sequence identical to a nucleotide sequence of SEQ ID NO: 1-3, a nucleotide sequence substantially 
identical to a nucleotide sequence of SEQ ID NO: 1 -3, or a nucleotide sequence that is complementary to 
the foregoing. The nucleic acid fragment may be identical, substantially identical or homologous to a 
nucleotide sequence in an exon or an intron in a nucleotide sequence of SEQ ID NO: 1-3, and may 
encode a domain or part of a domain of a polypeptide. Sometimes, the fragment will comprises one or 
more of the polymorphic variations described herein as being associated with osteoarthritis. The nucleic 
acid fragment is often 50, 100, or 200 or fewer base pairs in length, and is sometimes about 300, 400, 500, 
600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 2000, 3000, 4000, 5000, 10000, 15000, or 20000 
base pairs in length. A nucleic acid fragment that is complementary to a nucleotide sequence identical or 
substantially identical to a nucleotide sequence in SEQ ID NO: 1-3 and hybridizes to such a nucleotide 
sequence under stringent conditions is often referred to as a "probe." Nucleic acid fragments often include 
one or more polymorphic sites, or sometimes have an end that is adjacent to a polymorphic site as described 
hereafter. 

[0035] An example of a nucleic acid fragment is an oligonucleotide. As used herein, the term 
"oligonucleotide" refers to a nucleic acid comprising about 8 to about 50 covalently linked nucleotides, 
often comprising from about 8 to about 35 nucleotides, and more often from about 10 to about 25 
nucleotides. The backbone and nucleotides within an oligonucleotide may be the same as those of 
naturally occurring nucleic acids, or analogs or derivatives of naturally occurring nucleic acids, provided 
that oligonucleotides having such analogs or derivatives retain the ability to hybridize specifically to a 
nucleic acid comprising a targeted polymorphism. Oligonucleotides described herein may be used as 
hybridization probes or as components of prognostic or diagnostic assays, for example, as described 
herein. 

[0036] Oligonucleotides are typically synthesized using standard methods and equipment, such as 
the ABF M 3900 High Throughput DNA Synthesizer and the EXPEDITE™ 8909 Nucleic Acid 
Synthesizer, both of which are available from Applied Biosystems (Foster City, CA). Analogs and 
derivatives are exemplified in U.S. Pat. Nos. 4,469,863; 5,536,821; 5,541,306; 5,637,683; 5,637,684; 
5,700,922; 5,717,083; 5,719,262; 5,739,308; 5,773,601; 5,886,165; 5,929,226; 5,977,296; 6,140,482; 
WO 00/56746; WO 01/14398, and related publications. Methods for synthesizing oligonucleotides 
comprising such analogs or derivatives are disclosed, for example, in the patent publications cited above 
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and in U.S. Pat. Nos. 5,614,622; 5,739,314; 5,955,599; 5,962,674; 6,1 17,992; in WO 00/75372; and in 
related publications. 

(0037) Oligonucleotides may also be linked to a second moiety. The second moiety may be an 
additional nucleotide sequence such as a tail sequence (eg, a polyadenosine tail), an adapter sequence 
(e.g., phage M 13 universal tail sequence), and others. Alternatively, the second moiety may be a non- 
nucleotide moiety such as a moiety which facilitates linkage to a solid support or a label to facilitate 
detection of the oligonucleotide. Such labels include, without limitation, a radioactive label, a 
fluorescent label, a chemiluminescent label, a paramagnetic label, and the like. The second moiety may 
be attached to any position of the oligonucleotide, provided the oligonucleotide can hybridize to the 
nucleic acid comprising the polymorphism. 

Uses for Nucleic Acid Sequence 
[0038] Nucleic acid coding sequences may be used for diagnostic purposes for detection and control 
of polypeptide expression. Also, included herein are oligonucleotide sequences such as antisense RNA, 
small-interfering RNA (siRNA) and DNA molecules and ribozymes that function to inhibit translation of 
a polypeptide. Antisense techniques and RNA interference techniques are known in the art and are 
described herein. 

[0039] Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
hammerhead motif ribozyme molecules may be engineered that specifically and efficiently catalyze 
endonucleolytic cleavage of RNA sequences corresponding to or complementary to ADAMTS2 
nucleotide sequences. Specific ribozyme cleavage sites within any potential RNA target are initially 
identified by scanning the target molecule for ribozyme cleavage sites which include the following 
sequences, GUA, GUU and GUC. Once identified, short RNA sequences of between fifteen (15) and 
twenty (20) ribonucleotides corresponding to the region of the target gene containing the cleavage site 
may be evaluated for predicted structural features such as secondary structure that may render the 
oligonucleotide sequence unsuitable. The suitability of candidate targets may also be evaluated by testing 
their accessibility to hybridization with complementary oligonucleotides, using ribonuclease protection 
assays. 

[0040] Antisense RNA and DNA molecules, siRNA and ribozymes may be prepared by any method 
known in the art for the synthesis of RNA molecules. These include techniques for chemically 
synthesizing oligodeoxyribonucleotides well known in the art such as solid phase phosphoramidite 
chemical synthesis. Alternatively, RNA molecules may be generated by in vitro and in vivo transcription 
of DNA sequences encoding the antisense RNA molecule. Such DNA sequences may be incorporated 
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into a wide variety of vectors which incorporate suitable RNA polymerase promoters such as the T7 or 
SP6 polymerase promoters. Alternatively, antisense cDNA constructs that synthesize antisense RNA 
constitutively or inducibly, depending on the promoter used, can be introduced stably into cell lines. 

[0041] DNA encoding a polypeptide also may have a number of uses for the diagnosis of diseases, 
including osteoarthritis, resulting from aberrant expression of a target gene described herein. For 
example, the nucleic acid sequence may be used in hybridization assays of biopsies or autopsies to 
diagnose abnormalities of expression or function (e.g. , Southern or Northern blot analysis, in situ 
hybridization assays). 

[0042] In addition, the expression of a polypeptide during embryonic development may also be 
determined using nucleic acid encoding the polypeptide. As addressed, infra, production of functionally 
impaired polypeptide is the cause of various disease states, such as osteoarthritis. In situ hybridizations 
using polypeptide as a probe may be employed to predict problems related to osteoarthritis. Further, as 
indicated, infra, administration of human active polypeptide, recombinantly produced as described 
herein, may be used to treat disease states related to functionally impaired polypeptide. Alternatively, 
gene therapy approaches may be employed to remedy deficiencies of functional polypeptide or to replace 
or compete with dysfunctional polypeptide. 

Expression Vectors. Host Cells, and Genetically Engineered Cells 
[0043] Provided herein are nucleic acid vectors, often expression vectors, which contain a 
ADAMTS2 nucleotide sequence, or a substantially identical sequence thereof. As used herein, the term 
"vector" refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has 
been linked and can include a plasmid, cosmid, or viral vector. The vector can be capable of autonomous 
replication or it can integrate into a host DNA. Viral vectors may include replication defective 
retroviruses, adenoviruses and adeno-associated viruses for example. 

[0044] A vector can include a ADAMTS2 nucleotide sequence in a form suitable for expression of an 
encoded target polypeptide or target nucleic acid in a host cell. A "target polypeptide" is a polypeptide 
encoded by a ADAMTS2 nucleotide sequence, or a substantially identical nucleotide sequence thereof. 
The recombinant expression vector typically includes one or more regulatory sequences operatively 
linked to the nucleic acid sequence to be expressed. The term "regulatory sequence" includes promoters, 
enhancers and other expression control elements (e.g., polyadenylation signals). Regulatory sequences 
include those that direct constitutive expression of a nucleotide sequence, as well as tissue-specific 
regulatory and/or inducible sequences. The design of the expression vector can depend on such factors as 
the choice of the host cell to be transformed, the level of expression of polypeptide desired, and the like. 
Expression vectors can be introduced into host cells to produce target polypeptides, including fusion 
polypeptides. 
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[0045] Recombinant expression vectors can be designed for expression of target polypeptides in 
prokaryotic or eukaryotic cells. For example, target polypeptides can be expressed in E. coli, insect cells 
(e.g., using baculovirus expression vectors), yeast cells, or mammalian cells. Suitable host cells are 
discussed further in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic 
Press, San Diego, CA (1990). Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 

[0046] Expression of polypeptides in prokaryotes is most often carried out in E. coli with vectors 
containing constitutive or inducible promoters directing the expression of either fusion or non-fusion 
polypeptides. Fusion vectors add a number of amino acids to a polypeptide encoded therein, usually to 
the amino terminus of the recombinant polypeptide. Such fusion vectors typically serve three purposes: 
1) to increase expression of recombinant polypeptide; 2) to increase the solubility of the recombinant 
polypeptide; and 3) to aid in the purification of the recombinant polypeptide by acting as a ligand in 
affinity purification. Often, a proteolytic cleavage site is introduced at the junction of the fusion moiety 
and the recombinant polypeptide to enable separation of the recombinant polypeptide from the fusion 
moiety subsequent to purification of the fusion polypeptide. Such enzymes, and their cognate 
recognition sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression vectors 
include pGEX (Pharmacia Biotech Inc; Smith & Johnson, Gene 67: 31-40 (1988)), pMAL (New England 
Biolabs, Beverly, MA) and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase 
(GST), maltose E binding polypeptide, or polypeptide A, respectively, to the target recombinant 
polypeptide. 

[0047] Purified fusion polypeptides can be used in screening assays and to generate antibodies 
specific for target polypeptides. In a therapeutic embodiment, fusion polypeptide expressed in a 
retroviral expression vector is used to infect bone marrow cells that are subsequently transplanted into 
irradiated recipients. The pathology of the subject recipient is then examined after sufficient time has 
passed (e.g., six (6) weeks). 

[0048] Expressing the polypeptide in host bacteria with an impaired capacity to proteolytically 
cleave the recombinant polypeptide is often used to maximize recombinant polypeptide expression 
(Gottesman, S., Gene Expression Technology: Methods in Enzymology, Academic Press, San Diego, 
California 185: 1 19-128 (1990)). Another strategy is to alter the nucleotide sequence of the nucleic acid 
to be inserted into an expression vector so that the individual codons for each amino acid are those 
preferentially utilized in E. coli - (Wada et al.. Nucleic Acids Res. 20:2111-2118(1992)). Such alteration 
of nucleotide sequences can be carried out by standard DNA synthesis techniques. 

[0049] When used in mammalian cells, the expression vector's control functions are often provided 
by viral regulatory elements. For example, commonly used promoters are derived from polyoma, 
Adenovirus 2, cytomegalovirus and Simian Virus 40. Recombinant mammalian expression vectors are 
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often capable of directing expression of the nucleic acid in a particular cell type (e.g., tissue-specific 
regulatory elements are used to express the nucleic acid). Non-limiting examples of suitable tissue- 
specific promoters include an albumin promoter (liver-specific; Pinkert et al. Genes Dev. 1: 268-277 
(1987)), lymphoid-specific promoters (Calame & Eaton, Adv. Immunol. 43: 235-275 (1988)), promoters 
of T cell receptors (Winoto & Baltimore, EMBO J. 8: 729-733 (1989)) promoters of immunoglobulins 
(Banerji et al., Cell 33: 129-1 AO (1983); Queen & Baltimore, Cell 33: 741-748 (1983)), neuron-specific 
promoters (e.g., the neurofilament promoter; Byrne & Ruddle, Proc. Natl. Acad. Sci. USA 86: 5473- 
5477 (1989)), pancreas-specific promoters (Edlund et al.. Science 230: 912-916 (1985)), and mammary 
gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 4,873,316 and European 
Application Publication No. 264,166). Developmentalfy-regulated promoters are sometimes utilized, for 
example, the murine hox promoters (Kessel & Gruss, Science 249: 374-379 (1990)) and the ot- 
fetopolypeptide promoter (Campes & Tilghman, Genes Dev. 3: 537-546 (1989)). 

[0050J A ADAMTS2 nucleic acid also may be cloned into an expression vector in an antisense 
orientation. Regulatory sequences (e.g., viral promoters and/or enhancers) operatively linked to a 
ADAMTS2 nucleic acid cloned in the antisense orientation can be chosen for directing constitutive, tissue 
specific or cell type specific expression of antisense RNA in a variety of cell types. Antisense expression 
vectors can be in the form of a recombinant plasmid, phagemid or attenuated virus. For a discussion of 
the regulation of gene expression using antisense genes see, e.g., Weintraub et al., Antisense RNA as a 
molecular tool for genetic analysis, Reviews - Trends in Genetics, Vol. 1(1) (1986). 

[0051] Also provided herein are host cells that include a ADAMTS2 nucleotide sequence within a 
recombinant expression vector or a fragment of such a nucleotide sequence which facilitate homologous 
recombination into a specific site of the host cell genome. The terms "host cell" and "recombinant host 
cell" are used interchangeably herein. Such terms refer not only to the particular subject cell but rather 
also to the progeny or potential progeny of such a cell. Because certain modifications may occur in 
succeeding generations due to either mutation or environmental influences, such progeny may not, in 
fact, be identical to the parent cell, but are still included within the scope of the term as used herein. A 
host cell can be any prokaryotic or eukaryotic cell. For example, a target polypeptide can be expressed in 
bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as Chinese hamster ovary cells 
(CHO) or COS cells). Other suitable host cells are known to those skilled in the art. 

[0052] Vectors can be introduced into host cells via conventional transformation or transfection 
techniques. As used herein, the terms "transformation" and "transfection" are intended to refer to a 
variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, 
including calcium phosphate or calcium chloride co-precipitation, transduction/infection, DEAE-dextran- 
mediated transfection, lipofection, or electroporation. 
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[0053] A host cell provided herein can be used to produce (i.e., express) a target polypeptide or a 
substantially identical polypeptide thereof. Accordingly, further provided are methods for producing a 
target polypeptide using host cells described herein. In one embodiment, the method includes culturing 
host cells into which a recombinant expression vector encoding a target polypeptide has been introduced 
in a suitable medium such that a target polypeptide is produced. In another embodiment, the method 
further includes isolating a target polypeptide from the medium or the host cell. 

[0054] Also provided are cells or purified preparations of cells which include a ADAMTS2 
transgene, or which otherwise misexpress target polypeptide. Cell preparations can consist of human or 
non-human cells, e.g., rodent cells, e.g., mouse or rat cells, rabbit cells, or pig cells. In preferred 
embodiments, the cell or cells include a ADAMTS2 transgene (e.g., a heterologous form of a ADAMTS2 
gene, such as a human gene expressed in non-human cells). The transgene can be misexpressed, e.g., 
overexpressed or underexpressed. In other preferred embodiments, the cell or cells include a gene which 
misexpress an endogenous target polypeptide (e.g., expression of a gene is disrupted, also known as a 
knockout). Such cells can serve as a model for studying disorders which are related to mutated or mis- 
expressed alleles or for use in drug screening. Also provided are human cells (e.g., a hematopoietic stem 
cells) transfected with a ADAMTS2 nucleic acid. 

[0055] Also provided are cells or a purified preparation thereof (e.g., human cells) in which an 
endogenous ADAMTS2 nucleic acid is under the control of a regulatory sequence that does not normally 
control the expression of the endogenous gene. The expression characteristics of an endogenous gene 
within a cell (e.g., a cell line or microorganism) can be modified by inserting a heterologous DNA 
regulatory element into the genome of the cell such that the inserted regulatory element is operably 
linked to the corresponding endogenous gene. For example, an endogenous corresponding gene (e.g., a 
gene which is "transcriptionally silent," not normally expressed, or expressed only at very low levels) 
may be activated by inserting a regulatory element which is capable of promoting the expression of a 
normally expressed gene product in that cell. Techniques such as targeted homologous recombinations, 
can be used to insert the heterologous DNA as described in, e.g., Chappel, US 5,272,071; WO 91/06667, 
published on May 16, 1991. 

Transgenic Animals 

[0056] Non-human transgenic animals that express a heterologous target polypeptide (e.g., 
expressed from a ADAMTS2 nucleic acid or substantially identical sequence thereof) can be generated. 
Such animals are useful for studying the function and/or activity of a target polypeptide and for 
identifying and/or evaluating modulators of the activity of ADAMTS2 nucleic acids and encoded 
polypeptides. As used herein, a "transgenic animal" is a non-human animal such as a mammal (e.g. , a 
non-human primate such as chimpanzee, baboon, or macaque; an ungulate such as an equine, bovine, or 
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caprine; or a rodent such as a rat, a mouse, or an Israeli sand rat), a bird (e.g., a chicken or a turkey), an 
amphibian (e.g., a frog, salamander, or newt), or an insect (e.g., Drosophila melanogaster), in which one 
or more of the cells of the animal includes a transgene. A transgene is exogenous DNA or a 
rearrangement (e.g., a deletion of endogenous chromosomal DNA) that is often integrated into or occurs 
in the genome of cells in a transgenic animal. A transgene can direct expression of an encoded gene 
product in one or more cell types or tissues of the transgenic animal, and other transgenes can reduce 
expression (e.g., a knockout). Thus, a transgenic animal can be one in which an endogenous nucleic acid 
homologous to a ADAMTS2 nucleic acid has been altered by homologous recombination between the 
endogenous gene and an exogenous DNA molecule introduced into a cell of the animal (e.g., an 
embryonic cell of the animal) prior to development of the animal. 

[0057] Intronic sequences and polyadenylation signals can also be included in the transgene to 
increase expression efficiency of the transgene. One or more tissue-specific regulatory sequences can be 
operably linked to a ADAMTS2 nucleotide sequence to direct expression of an encoded polypeptide to 
particular cells. A transgenic founder animal can be identified based upon the presence of a ADAMTS2 
nucleotide sequence in its genome and/or expression of encoded mRNA in tissues or cells of the animals. 
A transgenic founder animal can then be used to breed additional animals carrying the transgene. 
Moreover, transgenic animals carrying a ADAMTS2 nucleotide sequence can further be bred to other 
transgenic animals carrying other transgenes. 

[0058] Target polypeptides can be expressed in transgenic animals or plants by introducing, for 
example, a ADAMTS2 nucleic acid into the genome of an animal that encodes the target polypeptide. In 
preferred embodiments the nucleic acid is placed under the control of a tissue specific promoter, e.g., a 
milk or egg specific promoter, and recovered from the milk or eggs produced by the animal. Also 
included is a population of cells from a transgenic animal. 

Tar get Polypeptides 

[0059] Also featured herein are isolated target polypeptides, which are encoded by a ADAMTS2 
nucleotide sequence (e.g., SEQ ID NO: 1-3), or a substantially identical nucleotide sequence thereof. 
ADAMTS2 exists in two forms, a "long" form comprising a molecule approximately 130 kDa in length 
(e.g., SEQ ID NO: 4), and a "short" form comprising a molecule approximately 70 kDa in length (e.g., 
SEQ ID NO: 5). Thus featured herein are isolated ADAMTS2 polypeptides, which include long and short 
isoforms, and substantially identical polypeptides thereof. An ADAMTS2 polypeptide is a polypeptide 
encoded by an ADAMTS2 nucleic acid, where one nucleic acid can encode one or more different 
polypeptides. The term "polypeptide" as used herein includes proteins and peptides. An "isolated" or 
"purified" polypeptide or protein is substantially free of cellular material or other contaminating proteins 
from the cell or tissue source from which the protein is derived, or substantially free from chemical 
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precursors or other chemicals when chemically synthesized. In one embodiment, the language 
"substantially free" means preparation of a target polypeptide having less than about 30%, 20%, 10% and 
more preferably 5% (by dry weight), of non-target polypeptide (also referred to herein as a 
"contaminating protein"), or of chemical precursors or non-target chemicals. When the target 
polypeptide or a biologically active portion thereof is recombinantly produced, it is also preferably 
substantially free of culture medium, specifically, where culture medium represents less than about 20%, 
sometimes less than about 10%, and often less than about 5% of the volume of the polypeptide 
preparation. Isolated or purified target polypeptide preparations are sometimes 0.01 milligrams or more 
or 0.1 milligrams or more, and often 1 .0 milligrams or more and 10 milligrams or more in dry weight. 

[0060] Further included herein are target polypeptide fragments. The polypeptide fragment may be 
a domain or part of a domain of a target polypeptide. The polypeptide fragment may have increased, 
decreased or unexpected biological activity. The polypeptide fragment is often 50 or fewer, 100 or fewer, 
or 200 or fewer amino acids in length, and is sometimes 300, 400, 500, 600, 700, or 900 or fewer amino 
acids in length. Thus, featured herein are ADAMTS2 polypeptides and biologically active or antigenic 
fragments thereof useful as reagents or targets in assays applicable to treatment and diagnosis of 
osteoarthritis. In another embodiment, provided herein are ADAMTS2 polypeptides having a ADAMTS2 
activity (e.g., a zinc binding activity, a metalloprotease activity, a procollagen II processing or synthesis 
activity, or a collagen II synthesis activity in vitro or in vivo). In certain embodiments, the polypeptides 
are ADAMTS2 proteins including at least one propeptide domain, at least one metalloproteinase domain, 
at least one disintegrin-like domain, at least one, two, three, and often four thrombospondin domains, and 
sometimes having a ADAMTS2 activity, e.g., a ADAMTS2 activity as described herein. ADAMTS2 
polypeptides and fragments thereof often have biological activity, such as excising the N-propeptide of 
type II procollagens. Methods for monitoring and quantifying this biological activity are known (e.g., 
Colige et al., J. Biol. Chem. 270: 16724-16730 (1995)). 

[0061] Human ADAMTS2 protein (SEQ ID NO: 4-5) includes a signal sequence of about 29 amino 
acids (from amino acid 1 to about amino acid 29 of SEQ ID NO: 4-5). The ADAMTS2 protein without 
the signal sequence can be approximately 1 1 82 amino acid residues in length (from about amino acid 30 
to amino acid 1211 of SEQ ID NO: 4) or approximately 485 amino acid residues in length (from about 
amino acid 30 to amino acid 514 of SEQ ID NO: 5). Human ADAMTS2 protein includes a "pro" region 
homologous to the reprolysin family propeptide, which is typically post-translationally cleaved upon 
conversion of the inactive (or pro-domain containing) protein to the catalytically active metalloprotease. 
The prodomain region of human ADAMTS2 protein corresponds to about amino acids 30 to 25 1 , 30 to 
252, 30 to 253, 30 to 254, 30 to 255, 30 to 256, 30 to 257, 30 to 258 or 30 to 259 of SEQ ID NO: 4-5, 
where it is understood that the active form of ADAMTS2 does not contain the propeptide domain. 
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[0062] Upon cleavage, catalytically active mature protein can be approximately 960, 959, 958, 957, 
956, 955, 954, 953 or 952 amino acids in length (from about amino acid 252, 253, 254, 255, 256, 257, 
258, 259 or 260 to amino acid 1211 of SEQ ID NO: 4) or approximately 261, 260, 259, 258, 257, 256, 
255, 254 or 253 amino acid residues in length (from about amino acid 252, 253, 254, 255, 256, 257, 258, 
259 or 260 to amino acid 514 of SEQ ID NO: 5). 

[0063] Human ADAMTS2 contains the following regions or other structural features: a signal 
sequence at about amino acids 1-29 of SEQ ID NO: 4-5; a reprolysin family propeptide domain located at 
about amino acid residues 30 to 251, 30 to 252, 30 to 253, 30 to 254, 30 to 255, 30 to 256, 30 to 257, 30 
to 258 or 30 to 259 of SEQ ID NO: 4-5; a zinc-metalloprotease catalytic domain at about amino acids 
251 to 479, 252 to 479, 253 to 479, 254 to 479, 255 to 479, 256 to 479, 257 to 479, 258 to 479 or 259 to 
479 of SEQ ED NO: 4-5; a disintegrin domain at about amino acids 480 to 560 of SEQ ED NO: 4; a 
cysteine-rich domain at about amino acids 618 to 722 of SEQ ED NO: 4; four thrombospondin motifs-2 
motifs at about amino acids 561 to 616, 854 to 912, 914 to 971, and 975 to 1029 of SEQ ED NO: 4; and 
eight N-glycosylation sites located at about amino acids 1 12, 251, 949, 993, 1031, 1098, 1 145, and 1 150 
of SEQ ED NO: 4. 

[0064] In other embodiments, provided are methods of increasing the synthesis of procollagen II 
comprising providing or administering to individuals in need of increasing levels of type II collagen the 
pharmaceutical or physiologically acceptable composition comprising active human ADAMTS2 protein 
or fragment thereof, where ADAMTS2 polypeptide fragments having activity are selected from amino 
acids 252-1211, 253-121 1, 254-121 1, 255-121 1, 256-121 1, 257-121 1, 258-121 1, 259-121 1 or 260-1211 
of SEQ ID NO: 4, where it is understood that the active form of ADAMTS2 does not contain the 
propeptide domain. 

[0065] Substantially identical target polypeptides may depart from the amino acid sequences of 
target polypeptides in different manners. For example, conservative amino acid modifications may be 
introduced at one or more positions in the amino acid sequences of target polypeptides. A "conservative 
amino acid substitution" is one in which the amino acid is replaced by another amino acid having a 
similar structure and/or chemical function. Families of amino acid residues having similar structures and 
functions are well known. These families include amino acids with basic side chains {e.g., lysine, 
arginine, histidine), acidic side chains {e.g. , aspartic acid, glutamic acid), uncharged polar side chains 
(e.g. , glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains {e.g. , 
alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side 
chains {e.g., threonine, valine, isoleucine) and aromatic side chains {e.g., tyrosine, phenylalanine, 
tryptophan, histidine). Also, essential and non-essential amino acids may be replaced. A "non-essential" 
amino acid is one that can be altered without abolishing or substantially altering the biological function 
of a target polypeptide, whereas altering an "essential" amino acid abolishes or substantially alters the 
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biological function of a target polypeptide. Amino acids that are conserved among target polypeptides 
are typically essential amino acids. In certain embodiments, the polypeptide includes one or more non- 
synonymous polymorphic variants associated with osteoarthritis. 

[00661 Also, target polypeptides may exist as chimeric or fusion polypeptides. As used herein, a 
target "chimeric polypeptide" or target "fusion polypeptide" includes a target polypeptide linked to a 
non-target polypeptide. A "non-target polypeptide" refers to a polypeptide having an amino acid 
sequence corresponding to a polypeptide which is not substantially identical to the target polypeptide, 
which includes, for example, a polypeptide that is different from the target polypeptide and derived from 
the same or a different organism. The target polypeptide in the fusion polypeptide can correspond to an 
entire or nearly entire target polypeptide or a fragment thereof. The non-target polypeptide can be fused 
to the N-terminus or C-terminus of the target polypeptide. 

[0067] Fusion polypeptides can include a moiety having high affinity for a ligand. For example, the 
fusion polypeptide can be a GST-target fusion polypeptide in which the target sequences are fused to the 
C-terminus of the GST sequences, or a polyhistidine-target fusion polypeptide in which the target 
polypeptide is fused at the N- or C-terminus to a string of histidine residues. Such fusion polypeptides 
can facilitate purification of recombinant target polypeptide. Expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide), and a nucleotide sequence in 
SEQ ID NO: 1-3, or a substantially identical nucleotide sequence thereof, can be cloned into an 
expression vector such that the fusion moiety is linked in-frame to the target polypeptide. Further, the 
fusion polypeptide can be a target polypeptide containing a heterologous signal sequence at its N- 
terminus. In certain host cells (e.g., mammalian host cells), expression, secretion, cellular internalization, 
and cellular localization of a target polypeptide can be increased through use of a heterologous signal 
sequence. Fusion polypeptides can also include all or a part of a serum polypeptide (e.g., an IgG constant 
region or human serum albumin). 

[0068] Target polypeptides can be incorporated into pharmaceutical compositions and administered 
to a subject in vivo. Administration of these target polypeptides can be used to affect the bioavailability 
of a substrate of the target polypeptide and may effectively increase target polypeptide biological activity 
in a cell. Target fusion polypeptides may be useful therapeutically for the treatment of disorders caused 
by, for example, (i) aberrant modification or mutation of a gene encoding a target polypeptide; (ii) mis- 
regulation of the gene encoding the target polypeptide; and (iii) aberrant post-translational modification 
of a target polypeptide. Also, target polypeptides can be used as immunogens to produce anti-target 
antibodies in a subject, to purify target polypeptide ligands or binding partners, and in screening assays to 
identify molecules which inhibit or enhance the interaction of a target polypeptide with a substrate. 

[0069] In addition, polypeptides can be chemically synthesized using techniques known in the art 
(See, e.g., Creighton, 1983 Proteins. New York, N.Y.: W. H. Freeman and Company; and Hunkapiller et 
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al., (1984) Nature July 12 -1 8;3 10(5973): 1 05-1 1). For example, a relative short fragment can be 
synthesized by use of a peptide synthesizer. Furthermore, if desired, non-classical amino acids or 
chemical amino acid analogs can be introduced as a substitution or addition into the fragment sequence. 
Non-classical amino acids include, but are not limited to, to the D-isomers of the common amino acids, 
2,4-diaminobutyric acid, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, g- 
Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, 
norleucme, norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, cysteic acid, t-butylglycine, t- 
butylalanine, phenylglycine, cyclohexylalanine, b-alanine, fluoroamino acids, designer amino acids such 
as b-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino acid analogs in 
general. Furthermore, the amino acid can be D (dextrorotary) or L (levorotary). 

[0070] Polypeptides and polypeptide fragments sometimes are differentially modified during or after 
translation, e.g., by glycosylation, acetylation, phosphorylation, amidation, derivatization by known 
protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other cellular ligand, 
etc. Any of numerous chemical modifications may be earned out by known techniques, including but not 
limited, to specific chemical cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, V8 protease, 
NaBH4; acetylation, formylation, oxidation, reduction; metabolic synthesis in the presence of 
tunicamycin; and the like. Additional post-translational modifications include, for example, N-linked or 
O-linked carbohydrate chains, processing of N-terminal or C-terminal ends), attachment of chemical 
moieties to the amino acid backbone, chemical modifications of N-linked or O-linked carbohydrate 
chains, and addition or deletion of an N-terminal methionine residue as a result of prokaryotic host cell 
expression. The polypeptide fragments may also be modified with a detectable label, such as an 
enzymatic, fluorescent, isotopic or affinity label to allow for detection and isolation of the polypeptide. 

[0071] Also provided are chemically modified derivatives of polypeptides that can provide 
additional advantages such as increased solubility, stability and circulating time of the polypeptide, or 
decreased immunogenicity (see e.g., U.S. Pat. No: 4,179,337. The chemical moieties for derivitization 
may be selected from water soluble polymers such as polyethylene glycol, ethylene glycol/propylene 
glycol copolymers, carboxymefhylcellulose, dextran, polyvinyl alcohol and the like. The polypeptides 
may be modified at random positions within the molecule, or at predetermined positions within the 
molecule and may include one, two, three or more attached chemical moieties. 

[0072] The polymer may be of any molecular weight, and may be branched or unbranched. For 
polyethylene glycol, the preferred molecular weight is between about 1 kDa and about 100 kDa (the term 
"about" indicating that in preparations of polyethylene glycol, some molecules will weigh more, some 
less, than the stated molecular weight) for ease in handling and manufacturing. Other sizes may be used, 
depending on the desired therapeutic profile (e.g., the duration of sustained release desired, the effects, if 
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any on biological activity, the ease in handling, the degree or lack of antigenicity and other known effects 
of the polyethylene glycol to a therapeutic protein or analog). 

[0073] The polymers should be attached to the polypeptide with consideration of effects on 
functional or antigenic domains of the polypeptide. There are a number of attachment methods available 
to those skilled in the art (e.g., EP 0 401 384 (coupling PEG to G-CSF) and Malik et al. (1992) Exp 
Hematol. September;20(8): 1028-35 (pegylation of GM-CSF using tresyl chloride)). For example, 
polyethylene glycol may be covalently bound through amino acid residues via a reactive group, such as a 
free amino or carboxyl group. Reactive groups are those to which an activated polyethylene glycol 
molecule may be bound. The amino acid residues having a free amino group may include lysine residues 
and the N-terminal amino acid residues; those having a free carboxyl group may include aspartic acid 
residues, glutamic acid residues and the C-terminal amino acid residue. Sulfhydryl groups may also be 
used as a reactive group for attaching the polyethylene glycol molecules. For therapeutic purposes, the 
attachment sometimes is at an amino group, such as attachment at the N-terminus or lysine group. 

[0074] Proteins can be chemically modified at the N-terminus. Using polyethylene glycol as an 
illustration of such a composition, one may select from a variety of polyethylene glycol molecules (by 
molecular weight, branching, and the like), the proportion of polyethylene glycol molecules to protein 
(polypeptide) molecules in the reaction mix, the type of pegylation reaction to be performed, and the 
method of obtaining the selected N-terminally pegylated protein. The method of obtaining the N- 
terminally pegylated preparation (i.e., separating this moiety from other monopegylated moieties if 
necessary) may be by purification of the N-terminally pegylated material from a population of pegylated 
protein molecules. Selective proteins chemically modified at the N-terminus may be accomplished by 
reductive alkylation, which exploits differential reactivity of different types of primary amino groups 
(lysine versus the N-terminal) available for derivatization in a particular protein. Under the appropriate 
reaction conditions, substantially selective derivatization of the protein at the N-terminus with a carbonyl 
group containing polymer is achieved. 

Substantially Identical Nucleic Acids and Polypeptides 
[0075] Nucleotide sequences and polypeptide sequences that are substantially identical to a 
ADAMTS2 nucleotide sequence and the target polypeptide sequences encoded by those nucleotide 
sequences, respectively, are included herein. The term "substantially identical" as used herein refers to 
two or more nucleic acids or polypeptides sharing one or more identical nucleotide sequences or 
polypeptide sequences, respectively. Included are nucleotide sequences or polypeptide sequences that 
are 55% or more, 60% or more, 65% or more, 70% or more, 75% or more, 80% or more, 85% or more, 
90% or more, 95% or more (each often within a 1%, 2%, 3% or 4% variability) identical to a ADAMTS2 
nucleotide sequence or the encoded target polypeptide amino acid sequences. One test for determining 
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whether two nucleic acids are substantially identical is to determine the percent of identical nucleotide 
sequences or polypeptide sequences shared between the nucleic acids or polypeptides. 

[0076] Calculations of sequence identity are often performed as follows. Sequences are aligned for 
optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino 
acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded 
for comparison purposes). The length of a reference sequence aligned for comparison purposes is 
sometimes 30% or more, 40% or more, 50% or more, often 60% or more, and more often 70% or more, 
80% or more, 90% or more, or 100% of the length of the reference sequence. The nucleotides or amino 
acids at corresponding nucleotide or polypeptide positions, respectively, are then compared among the 
two sequences. When a position in the first sequence is occupied by the same nucleotide or amino acid 
as the corresponding position in the second sequence, the nucleotides or amino acids are deemed to be 
identical at that position. The percent identity between the two sequences is a function of the number of 
identical positions shared by the sequences, taking into account the number of gaps, and the length of 
each gap, introduced for optimal alignment of the two sequences. 

[00771 Comparison of sequences and determination of percent identity between two sequences can 
be accomplished using a mathematical algorithm. Percent identity between two amino acid or nucleotide 
sequences can be determined using the algorithm of Meyers & Miller, CABIOS 4: 11-17 (1989), which 
has been incorporated into the ALIGN program (version 2.0), using a PAM120 weight residue table, a 
gap length penalty of 12 and a gap penalty of 4. Also, percent identity between two amino acid 
sequences can be determined using the Needleman & Wunsch, /. Mol. Biol. 48: 444-453 (1970) 
algorithm which has been incorporated into the GAP program in the GCG software package (available at 
the http address www.gcg.com), using either a Blossum 62 matrix or a PAM250 matrix, and a gap weight 
of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. Percent identity between two 
nucleotide sequences can be determined using the GAP program in the GCG software package (available 
at http address www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 
80 and a length weight of 1, 2, 3, 4, 5, or 6. A set of parameters often used is a Blossum 62 scoring 
matrix with a gap open penalty of 12, a gap extend penalty of 4, and a frameshift gap penalty of 5. 

[0078] Another manner for determining if two nucleic acids are substantially identical is to assess 
whether a polynucleotide homologous to one nucleic acid will hybridize to the other nucleic acid under 
stringent conditions. As use herein, the term "stringent conditions" refers to conditions for hybridization 
and washing. Stringent conditions are known to those skilled in the art and can be found in Current 
Protocols in Molecular Biology, John Wiley & Sons, N.Y. , 6.3. 1 -6.3.6 (1 989). Aqueous and non- 
aqueous methods are described in that reference and either can be used. An example of stringent 
hybridization conditions is hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
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followed by one or more washes in 0.2X SSC, 0.1% SDS at 50°C. Another example of stringent 
hybridization conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2X SSC, 0.1% SDS at 55°C. A further example of stringent 
hybridization conditions is hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2X SSC, 0.1% SDS at 60°C. Often, stringent hybridization 
conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, followed by one 
or more washes in 0.2X SSC, 0.1% SDS at 65°C. More often, stringency conditions are 0.5M sodium 
phosphate, 7% SDS at 65°C, followed by one or more washes at 0.2X SSC, 1% SDS at 65°C. 

[0079] An example of a substantially identical nucleotide sequence to a nucleotide sequence in SEQ 
ID NO: 1-3 is one that has a different nucleotide sequence but still encodes the same polypeptide 
sequence encoded by the nucleotide sequence in SEQ ID NO: 1-3. Another example is a nucleotide 
sequence that encodes a polypeptide having a polypeptide sequence that is more than 70% or more 
identical to, sometimes more than 75% or more, 80% or more, or 85% or more identical to, and often 
more than 90% or more and 95% or more identical to a polypeptide sequence encoded by a nucleotide 
sequence in SEQ ID NO: 1-3. 

[0080] Nucleotide sequences in SEQ ID NO: 1-3 and amino acid sequences of encoded polypeptides 
can be used as "query sequences" to perform a search against public databases to identify other family 
members or related sequences, for example. Such searches can be performed using the NBLAST and 
XBLAST programs (version 2.0) of Altschul et al, J. Mol. Biol. 215: 403-10 (1990). BLAST nucleotide 
searches can be performed with the NBLAST program, score — 100, wordlength = 12 to obtain 
nucleotide sequences homologous to nucleotide sequences in SEQ ID NO: 1-3. BLAST polypeptide 
searches can be performed with the XBLAST program, score = 50, wordlength = 3 to obtain amino acid 
sequences homologous to polypeptides encoded by the nucleotide sequences of SEQ ID NO: 1-3. To 
obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in 
Altschul et al, Nucleic Acids Res. 25(1 7): 3389-3402 (1997). When utilizing BLAST and Gapped 
BLAST programs, default parameters of the respective programs (e.g. , XBLAST and NBLAST) can be 
used (see the http address www.ncbi.nlm.nih.gov). 

[0081] A nucleic acid that is substantially identical to a nucleotide sequence in SEQ ID NO: 1-3 
may include polymorphic sites at positions equivalent to those described herein when the sequences are 
aligned. For example, using the alignment procedures described herein, SNPs in a sequence substantially 
identical to a sequence in SEQ ID NO: 1-3 can be identified at nucleotide positions that match (i.e., align) 
with nucleotides at SNP positions in each nucleotide sequence in SEQ ID NO: 1-3. Also, where a 
polymorphic variation results in an insertion or deletion, insertion or deletion of a nucleotide sequence 
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from a reference sequence can change the relative positions of other polymorphic sites in the nucleotide 
sequence. 

[0082] Substantially identical nucleotide and polypeptide sequences include those that are naturally 
occurring, such as allelic variants (same locus), splice variants, homologs (different locus), and orthologs 
(different organism) or can be non-naturally occurring. Non-naturally occurring variants can be 
generated by mutagenesis techniques, including those applied to polynucleotides, cells, or organisms. 
The variants can contain nucleotide substitutions, deletions, inversions and insertions. Variation can 
occur in either or both the coding and non-coding regions. The variations can produce both conservative 
and non-conservative amino acid substitutions (as compared in the encoded product). Orthologs, 
homologs, allelic variants, and splice variants can be identified using methods known in the art. These 
variants normally comprise a nucleotide sequence encoding a polypeptide that is 50% or more, about 
55% or more, often about 70-75% or more or about 80-85% or more, and sometimes about 90-95% or 
more identical to the amino acid sequences of target polypeptides or a fragment thereof. Such nucleic 
acid molecules can readily be identified as being able to hybridize under stringent conditions to a 
nucleotide sequence in SEQ ID NO: 1-3 or a fragment of this sequence. Nucleic acid molecules 
corresponding to orthologs, homologs, and allelic variants of a nucleotide sequence in SEQ ID NO: 1-3 
can further be identified by mapping the sequence to the same chromosome or locus as the nucleotide 
sequence in SEQ ID NO: 1-3. 

[0083] Also, substantially identical nucleotide sequences may include codons that are altered with 
respect to the naturally occurring sequence for enhancing expression of a target polypeptide in a 
particular expression system. For example, the nucleic acid can be one in which one or more codons are 
altered, and often 10% or more or 20% or more of the codons are altered for optimized expression in 
bacteria (e.g., E. coli.\ yeast (e.g., S. cervesiae), human (e.g., 293 cells), insect, or rodent (e.g., hamster) 
cells. 

Methods for Identifying Risk of Osteoarthritis 
[0084] Methods for prognosing and diagnosing osteoarthritis are included herein. These methods 
include detecting the presence or absence of one or more polymorphic variations in a nucleotide sequence 
associated with osteoarthritis, such as variants in or around the loci set forth herein, or a substantially 
identical sequence thereof, in a sample from a subject, where the presence of a polymorphic variant 
described herein is indicative of a risk of osteoarthritis. Determining a risk of osteoarthritis sometimes 
refers to determining whether an individual is at an increased risk of osteoarthritis (e.g., intermediate risk 
or higher risk). 

[0085] Thus, featured herein is a method for identifying a subject who is at risk of osteoarthritis, 
which comprises detecting an aberration associated with osteoarthritis in a nucleic acid sample from the 
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subject. An embodiment is a method for detecting a risk of osteoarthritis in a subject, which comprises 
detecting the presence or absence of a polymorphic variation associated with osteoarthritis at a 
polymorphic site in a nucleotide sequence in a nucleic acid sample from a subject, where the nucleotide 
sequence comprises a polynucleotide sequence selected from the group consisting of: (a) a nucleotide 
sequence of SEQ ID NO: 1-3; (b) a nucleotide sequence which encodes a polypeptide consisting of an 
amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 1-3; (c) a nucleotide sequence 
which encodes a polypeptide that is 90% or more identical to an amino acid sequence encoded by a 
nucleotide sequence of SEQ ID NO: 1-3, or a nucleotide sequence about 90% or more identical to a 
nucleotide sequence of SEQ ID NO: 1-3; and (d) a fragment of a nucleotide sequence of (a), (b), or (c) 
comprising the polymorphic site; whereby the presence of the polymorphic variation is indicative of a 
predisposition to osteoarthritis in the subject. In some embodiments, a polymorphic variation at position 
733 of SEQ ID NO: 2-3 may be detected (e.g., a guanine in these sequences or a cytosine in a 
complementary sequence are associated with increased risk of osteoarthritis). In certain embodiments, 
polymorphic variants at the positions described herein are detected for determining a risk of 
osteoarthritis, and polymorphic variants at positions in linkage disequilibrium with these positions are 
detected for determining a risk of osteoarthritis. As used herein, "SEQ ID NO: 1 -3" refers to individual 
sequences in SEQ ID NO: 1, 2 or 3, each sequence being separately applicable to embodiments described 
herein. 

[0086] Risk of osteoarthritis sometimes is expressed as a probability, such as an odds ratio, 
percentage, or risk factor. Risk often is based upon the presence or absence of one or more polymorphic 
variants described herein, and also may be based in part upon phenotypic traits of the individual being 
tested. Methods for calculating risk based upon patient data are well known (see, e.g., Agresti, 
Categorical Data Analysis, 2nd Ed. 2002. Wiley). Alleloryping and genotyping analyses may be carried 
out in populations other than those exemplified herein to enhance the predictive power of the prognostic 
method. These further analyses are executed in view of the exemplified procedures described herein, and 
may be based upon the same polymorphic variations or additional polymorphic variations. 

[0087] In certain embodiments, determining the presence of a combination of two or more 
polymorphic variants associated with osteoarthritis in one or more genetic loci (e.g., one or more genes) 
of the sample is determined to identify, quantify and/or estimate, risk of osteoarthritis. The risk often is 
the probability of having or developing osteoarthritis. The risk sometimes is expressed as a relative risk 
with respect to a population average risk of osteoarthritis, and sometimes is expressed as a relative risk 
with respect to the lowest risk group. Such relative risk assessments often are based upon penetrance 
values determined by statistical methods, and are particularly useful to clinicians and insurance 
companies for assessing risk of osteoarthritis (e.g., a clinician can target appropriate detection, prevention 
and therapeutic regimens to a patient after determining the patient's risk of osteoarthritis, and an 
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insurance company can fine tune actuarial tables based upon population genotype assessments of 
osteoarthritis risk). Risk of osteoarthritis sometimes is expressed as an odds ratio, which is the odds of a 
particular person having a genotype has or will develop osteoarthritis with respect to another genotype 
group (e.g., the most disease protective genotype or population average). In related embodiments, the 
determination is utilized to identify a subject at risk of osteoarthritis. In an embodiment, two or more 
polymorphic variations are detected in two or more regions in human genomic DNA associated with 
increased risk of osteoarthritis, such as a locus containing a ADAMTS2, for example. In certain 
embodiments, 3 or more, or 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 or more 
polymorphic variants are detected in the sample. In specific embodiments, polymorphic variants are 
detected in a ADAMTS2 region, for example. In certain embodiments, polymorphic variants are detected 
at other genetic loci (e.g., the polymorphic variants can be detected in ADAMTS2 in addition to other loci 
or only in other loci), where the other loci include but are not limited to those described in concurrently- 
filed patent applications having attorney docket number 524593008700, 524593008800, 524593009000 
or 524593009200, which is incorporated herein by reference in its entirety. 

[0088] Results from prognostic tests may be combined with other test results to diagnose 
osteoarthritis. For example, prognostic results may be gathered, a patient sample may be ordered based 
on a determined predisposition to osteoarthritis, the patient sample is analyzed, and the results of the 
analysis may be utilized to diagnose osteoarthritis. Also osteoarthritis diagnostic method can be 
developed from studies used to generate prognostic methods in which populations are stratified into 
subpopulations having different progressions of osteoarthritis. In another embodiment, prognostic results 
may be gathered, a patient's risk factors for developing osteoarthritis (e.g., age, weight, race, diet) 
analyzed, and a patient sample may be ordered based on a determined predisposition to osteoarthritis. 

[0089] The nucleic acid sample typically is isolated from a biological sample obtained from a 
subject. For example, nucleic acid can be isolated from blood, saliva, sputum, urine, cell scrapings, and 
biopsy tissue. The nucleic acid sample can be isolated from a biological sample using standard 
techniques, such as the technique described in Example 2. As used herein, the term "subject" refers 
primarily to humans but also refers to other mammals such as dogs, cats, and ungulates (e.g., cattle, 
sheep, and swine). Subjects also include avians (e.g., chickens and turkeys), reptiles, and fish (e.g., 
salmon), as embodiments described herein can be adapted to nucleic acid samples isolated from any of 
these organisms. The nucleic acid sample may be isolated from the subject and then directly utilized in a 
method for determining the presence of a polymorphic variant, or alternatively, the sample may be 
isolated and then stored (e.g., frozen) for a period of time before being subjected to analysis. 

[0090] The presence or absence of a polymorphic variant is determined using one or both 
chromosomal complements represented in the nucleic acid sample. Determining the presence or absence 
of a polymorphic variant in both chromosomal complements represented in a nucleic acid sample from a 
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subject having a copy of each chromosome is useful for determining the zygosity of an individual for the 
polymorphic variant (i.e., whether the individual is homozygous or heterozygous for the polymorphic 
variant). Any oligonucleotide-based diagnostic may be utilized to determine whether a sample includes 
the presence or absence of a polymorphic variant in a sample. For example, primer extension methods, 
ligase sequence determination methods (e.g., U.S. Pat. Nos. 5,679,524 and 5,952,1 74, and WO 
01/27326), mismatch sequence determination methods (e.g., U.S. Pat. Nos. 5,851,770; 5,958,692; 
6,110,684; and 6,183,958), microarray sequence determination methods, restriction fragment length 
polymorphism (RFLP), single strand conformation polymorphism detection (SSCP) (e.g., U.S. Pat. Nos. 
5,891,625 and 6,013,499), PCR-based assays (e.g., TAQMAN® PCR System (Applied Biosystems)), and 
nucleotide sequencing methods may be used. 

[0091] Oligonucleotide extension methods typically involve providing a pair of oligonucleotide 
primers in a polymerase chain reaction (PCR) or in other nucleic acid amplification methods for the 
purpose of amplifying a region from the nucleic acid sample that comprises the polymorphic variation. 
One oligonucleotide primer is complementary to a region 3' of the polymorphism and the other is 
complementary to a region 5' of the polymorphism. A PCR primer pair may be used in methods 
disclosed in U.S. Pat. Nos. 4,683,195; 4,683,202, 4,965,188; 5,656,493; 5,998,143; 6,140,054; WO 
01/27327; and WO 01/27329 for example. PCR primer pairs may also be used in any commercially 
available machines that perform PCR, such as any of the GENEAMP® Systems available from Applied 
Biosystems. Also, those of ordinary skill in the art will be able to design oligonucleotide primers based 
upon a ADAMTS2 nucleotide sequence using knowledge available in the art. 

[0092] Also provided is an extension oligonucleotide that hybridizes to the amplified fragment 
adjacent to the polymorphic variation. As used herein, the term "adjacent" refers to the 3' end of the 
extension oligonucleotide being often 1 nucleotide from the 5' end of the polymorphic site, and 
sometimes 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides from the 5' end of the polymorphic site, in the nucleic 
acid when the extension oligonucleotide is hybridized to the nucleic acid. The extension oligonucleotide 
then is extended by one or more nucleotides, and the number and/or type of nucleotides that are added to 
the extension oligonucleotide determine whether the polymorphic variant is present. Oligonucleotide 
extension methods are disclosed, for example, in U.S. Pat. Nos. 4,656,127; 4,851,331; 5,679,524; 
5,834,189; 5,876,934; 5,908,755; 5,912,118; 5,976,802; 5,981,186; 6,004,744; 6,013,431; 6,017,702; 
6,046,005; 6,087,095; 6,210,891; and WO 01/20039. Oligonucleotide extension methods using mass 
spectrometry are described, for example, in U.S. Pat. Nos. 5,547,835; 5,605,798; 5,691,141; 5,849,542; 
5,869,242; 5,928,906; 6,043,031; and 6,194,144, and a method often utilized is described herein in 
Example 2. 

[0093] A microarray can be utilized for determining whether a polymorphic variant is present or 
absent in a nucleic acid sample. A microarray may include any oligonucleotides described herein, and 
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methods for making and using oligonucleotide microarrays suitable for diagnostic use are disclosed in 
U.S. Pat. Nos. 5,492,806; 5,525,464; 5,589,330; 5,695,940; 5,849,483; 6,018,041; 6,045,996; 6,136,541; 
6,142,681; 6,156,501; 6,197,506; 6,223,127; 6,225,625; 6,229,911; 6,239,273; WO 00/52625; WO 
01/25485; and WO 01/29259. The microarray typically comprises a solid support and the 
oligonucleotides may be linked to this solid support by covalent bonds or by non-covalent interactions. 
The oligonucleotides may also be linked to the solid support directly or by a spacer molecule. A 
microarray may comprise one or more oligonucleotides complementary to a polymorphic site set forth 
herein. 

[0094| A kit also may be utilized for determining whether a polymorphic variant is present or absent 
in a nucleic acid sample. A kit often comprises one or more pairs of oligonucleotide primers useful for 
amplifying a fragment of a nucleotide sequence of SEQ ID NO: 1-3 or a substantially identical sequence 
thereof, where the fragment includes a polymorphic site. The kit sometimes comprises a polymerizing 
agent, for example, a thermostable nucleic acid polymerase such as one disclosed in U.S. Pat. Nos. 
4,889,818 or 6,077,664. Also, the kit often comprises an elongation oligonucleotide that hybridizes to a 
ADAMTS2 nucleotide sequence in a nucleic acid sample adjacent to the polymorphic site. Where the kit 
includes an elongation oligonucleotide, it also often comprises chain elongating nucleotides, such as 
dATP, dTTP, dGTP, dCTP, and dITP, including analogs of dATP, dTTP, dGTP, dCTP and dITP, 
provided that such analogs are substrates for a thermostable nucleic acid polymerase and can be 
incorporated into a nucleic acid chain elongated from the extension oligonucleotide. Along with chain 
elongating nucleotides would be one or more chain terminating nucleotides such as ddATP, ddTTP, 
ddGTP, ddCTP, and the like. In an embodiment, the kit comprises one or more oligonucleotide primer 
pairs, a polymerizing agent, chain elongating nucleotides, at least one elongation oligonucleotide, and 
one or more chain terminating nucleotides. Kits optionally include buffers, vials, microtiter plates, and 
instructions for use. 

[0095] An individual identified as being at risk of osteoarthritis may be heterozygous or 
homozygous with respect to the allele associated with a higher risk of osteoarthritis. A subject 
homozygous for an allele associated with an increased risk of osteoarthritis is at a comparatively high 
risk of osteoarthritis, a subject heterozygous for an allele associated with an increased risk of 
osteoarthritis is at a comparatively intermediate risk of osteoarthritis, and a subject homozygous for an 
allele associated with a decreased risk of osteoarthritis is at a comparatively low risk of osteoarthritis. A 
genotype may be assessed for a complementary strand, such that the complementary nucleotide at a 
particular position is detected. 

(0096] Also featured are methods for determining risk of osteoarthritis and/or identifying a subject 
at risk of osteoarthritis by contacting a polypeptide or protein encoded by a ADAMTS2 nucleotide 
sequence from a subject with an antibody that specifically binds to an epitope associated with increased 
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risk of osteoarthritis in the polypeptide (e.g., an epitope comprising a valine at position 245 in an IR1LR1 
polypeptide). 

Applications of Prognostic and Diagnostic Results to Pharmacogenomic Methods 

|0097] Pharmacogenomics is a discipline that involves tailoring a treatment for a subject according 
to the subject's genotype as a particular treatment regimen may exert a differential effect depending upon 
the subject's genotype. For example, based upon the outcome of a prognostic test described herein, a 
clinician or physician may target pertinent information and preventative or therapeutic treatments to a 
subject who would be benefited by the information or treatment and avoid directing such information and 
treatments to a subject who would not be benefited (e.g., the treatment has no therapeutic effect and/or 
the subject experiences adverse side effects). 

[0098] The following is an example of a pharmacogenomic embodiment. A particular treatment 
regimen can exert a differential effect depending upon the subject's genotype. Where a candidate 
therapeutic exhibits a significant interaction with a major allele and a comparatively weak interaction 
with a minor allele (e.g., an order of magnitude or greater difference in the interaction), such a 
therapeutic typically would not be administered to a subject genotyped as being homozygous for the 
minor allele, and sometimes not administered to a subject genotyped as being heterozygous for the minor 
allele. In another example, where a candidate therapeutic is not significantly toxic when administered to 
subjects who are homozygous for a major allele but is comparatively toxic when administered to subjects 
heterozygous or homozygous for a minor allele, the candidate therapeutic is not typically administered to 
subjects who are genotyped as being heterozygous or homozygous with respect to the minor allele. 

[0099] The methods described herein are applicable to pharmacogenomic methods for preventing, 
alleviating or treating osteoarthritis. For example, a nucleic acid sample from an individual may be 
subjected to a prognostic test described herein. Where one or more polymorphic variations associated 
with increased risk of osteoarthritis are identified in a subject, information for preventing or treating 
osteoarthritis and/or one or more osteoarthritis treatment regimens then may be prescribed to that subject. 

10100] In certain embodiments, a treatment or preventative regimen is specifically prescribed and/or 
administered to individuals who will most benefit from it based upon their risk of developing 
osteoarthritis assessed by the methods described herein. Thus, provided are methods for identifying a 
subject predisposed to osteoarthritis and then prescribing a therapeutic or preventative regimen to 
individuals identified as having a predisposition. Thus, certain embodiments are directed to a method for 
reducing osteoarthritis in a subject, which comprises: detecting the presence or absence of a polymorphic 
variant associated with osteoarthritis in a nucleotide sequence in a nucleic acid sample from a subject, 
where the nucleotide sequence comprises a polynucleotide sequence selected from the group consisting 
of: (a) a nucleotide sequence of SEQ ED NO: 1 -3; (b) a nucleotide sequence which encodes a polypeptide 
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consisting of an amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 1-3; (c) a 
nucleotide sequence which encodes a polypeptide that is 90% or more identical to an amino acid 
sequence encoded by a nucleotide sequence of SEQ ID NO: 1-3, or a nucleotide sequence about 90% or 
more identical to a nucleotide sequence of SEQ ID NO: 1-3; and (d) a fragment of a polynucleotide 
sequence of (a), (b), or (c); and prescribing or administering a treatment regimen to a subject from whom 
the sample originated where the presence of a polymorphic variation associated with osteoarthritis is 
detected in the nucleotide sequence. In these methods, predisposition results may be utilized in 
combination with other test results to diagnose osteoarthritis. 

[0101] Certain preventative treatments often are prescribed to subjects having a predisposition to 
osteoarthritis and where the subject is diagnosed with osteoarthritis or is diagnosed as having symptoms 
indicative of an early stage of osteoarthritis. The treatment sometimes is preventative {e.g., is prescribed 
or administered to reduce the probability that osteoarthritis arises or progresses), sometimes is 
therapeutic, and sometimes delays, alleviates or halts the progression of osteoarthritis. Any known 
preventative or therapeutic treatment for alleviating or preventing the occurrence of osteoarthritis is 
prescribed and/or administered. For example, the treatment often is directed to decreasing pain and 
improving joint movement. Examples of OA treatments include exercises to keep joints flexible and 
improve muscle strength. Different medications to control pain, including corticosteroids and 
nonsteroidal anti-inflammatory drugs (NSAIDs, e.g., Voltaren); cyclooxygenase-2 (COX-2) inhibitors 
(e.g., Celebrex, Vioxx, Mobic, and Bextra); monoclonal antibodies (e.g., Remicade); tumor necrosis 
factor inhibitors (e.g., Enbrel); or injections of glucocorticoids, hyaluronic acid or chondrotin sulfate into 
joints that are inflamed and not responsive to NSAIDS. Orally administered chondroitin sulfate also may 
be used as a therapeutic, as it may increase hyaluronic acid levels and viscosity of synovial fluid, and 
decrease collagenase levels in synovial fluid. Also, glucosamine can serve as an OA therapeutic as 
delivering it into joints may inhibit enzymes involved in cartilage degradation and enhance the 
production of hyaluronic acid. For mild pain without inflammation, acetaminophen may be used. Other 
treatments include: heat/cold therapy for temporary pain relief; joint protection to prevent strain or stress 
on painful joints; surgery to relieve chronic pain in damaged joints; and weight control to prevent extra 
stress on weight-bearing joints. 

[0102] As therapeutic approaches for treating osteoarthritis continue to evolve and improve, the goal 
of treatments for osteoarthritis related disorders is to intervene even before clinical signs first manifest. 
Thus, genetic markers associated with susceptibility to osteoarthritis prove useful for early diagnosis, 
prevention and treatment of osteoarthritis. 

[0103] As osteoarthritis preventative and treatment information can be specifically targeted to 
subjects in need thereof (e.g., those at risk of developing osteoarthritis or those in an early stage of 
osteoarthritis), provided herein is a method for preventing or reducing the risk of developing 
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osteoarthritis in a subject, which comprises: (a) detecting the presence or absence of a polymorphic 
variation associated with osteoarthritis at a polymorphic site in a nucleotide sequence in a nucleic acid 
sample from a subject; (b) identifying a subject with a predisposition to osteoarthritis, whereby the 
presence of the polymorphic variation is indicative of a predisposition to osteoarthritis in the subject; and 
(c) if such a predisposition is identified, providing the subject with information about methods or 
products to prevent or reduce osteoarthritis or to delay the onset of osteoarthritis. Also provided is a 
method of targeting information or advertising to a subpopulation of a human population based on the 
subpopulation being genetically predisposed to a disease or condition, which comprises: (a) detecting the 
presence or absence of a polymorphic variation associated with osteoarthritis at a polymorphic site in a 
nucleotide sequence in a nucleic acid sample from a subject; (b) identifying the subpopulation of subjects 
in which the polymorphic variation is associated with osteoarthritis; and (c) providing information only 
to the subpopulation of subjects about a particular product which may be obtained and consumed or 
applied by the subject to help prevent or delay onset of the disease or condition. 

[0104] Pharmacogenomics methods also may be used to analyze and predict a response to 
osteoarthritis treatment or a drug. For example, if pharmacogenomics analysis indicates a likelihood that 
an individual will respond positively to osteoarthritis treatment with a particular drug, the drug may be 
administered to the individual. Conversely, if the analysis indicates that an individual is likely to respond 
negatively to treatment with a particular drug, an alternative course of treatment may be prescribed. A 
negative response may be defined as either the absence of an efficacious response or the presence of toxic 
side effects. The response to a therapeutic treatment can be predicted in a background study in which 
subjects in any of the following populations are genotyped: a population that responds favorably to a 
treatment regimen, a population that does not respond significantly to a treatment regimen, and a 
population that responds adversely to a treatment regimen (e.g., exhibits one or more side effects). These 
populations are provided as examples and other populations and subpopulations may be analyzed. Based 
upon the results of these analyses, a subject is genotyped to predict whether he or she will respond 
favorably to a treatment regimen, not respond significantly to a treatment regimen, or respond adversely 
to a treatment regimen. 

[0105] The tests described herein also are applicable to clinical drug trials. One or more 
polymorphic variants indicative of response to an agent for treating osteoarthritis or to side effects to an 
agent for treating osteoarthritis may be identified using the methods described herein. Thereafter, 
potential participants in clinical trials of such an agent may be screened to identify those individuals most 
likely to respond favorably to the drug and exclude those likely to experience side effects. In that way, 
the effectiveness of drug treatment may be measured in individuals who respond positively to the drug, 
without lowering the measurement as a result of the inclusion of individuals who are unlikely to respond 
positively in the study and without risking undesirable safety problems. 
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[01 06] Thus, another embodiment is a method of selecting an individual for inclusion in a clinical 
trial of a treatment or drug comprising the steps of: (a) obtaining a nucleic acid sample from an 
individual; (b) determining the identity of a polymorphic variation which is associated with a positive 
response to the treatment or the drug, or at least one polymorphic variation which is associated with a 
negative response to the treatment or the drug in the nucleic acid sample, and (c) including the individual 
in the clinical trial if the nucleic acid sample contains said polymorphic variation associated with a 
positive response to the treatment or the drug or if the nucleic acid sample lacks said polymorphic 
variation associated with a negative response to the treatment or the drug. In addition, the methods 
described herein for selecting an individual for inclusion in a clinical trial of a treatment or drug 
encompass methods with any further limitation described in this disclosure, or those following, specified 
alone or in any combination. The polymorphic variation may be in a sequence selected individually or in 
any combination from the group consisting of (i) a nucleotide sequence of SEQ ID NO: 1-3; (ii) a 
nucleotide sequence which encodes a polypeptide consisting of an amino acid sequence encoded by a 
nucleotide sequence of SEQ ID NO: 1-3; (lii) a nucleotide sequence which encodes a polypeptide that is 
90% or more identical to an amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 1-3, 
or a nucleotide sequence about 90% or more identical to a nucleotide sequence of SEQ ID NO: 1-3; and 
(iv) a fragment of a polynucleotide sequence of (i), (ii), or (iii) comprising the polymorphic site. The 
including step (c) optionally comprises administering the drug or the treatment to the individual if the 
nucleic acid sample contains the polymorphic variation associated with a positive response to the 
treatment or the drug and the nucleic acid sample lacks said biallelic marker associated with a negative 
response to the treatment or the drug. 

[0107] Also provided herein is a method of partnering between a diagnostic/prognostic testing 
provider and a provider of a consumable product, which comprises: (a) the diagnostic/prognostic testing 
provider detects the presence or absence of a polymorphic variation associated with osteoarthritis at a 
polymorphic site in a nucleotide sequence in a nucleic acid sample from a subject; (b) the 
diagnostic/prognostic testing provider identifies the subpopulation of subjects in which the polymorphic 
variation is associated with osteoarthritis; (c) the diagnostic/prognostic testing provider forwards 
information to the subpopulation of subjects about a particular product which may be obtained and 
consumed or applied by the subject to help prevent or delay onset of the disease or condition; and (d) the 
provider of a consumable product forwards to the diagnostic test provider a fee every time the 
diagnostic/prognostic test provider forwards information to the subject as set forth in step (c) above. 

Compositions Comprising Osteoarthritis-Directed Molecules 
[0108] Featured herein is a composition comprising a cell from a subject having osteoarthritis or at 
risk of osteoarthritis and one or more molecules specifically directed and targeted to a nucleic acid 
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comprising a ADAMTS2 nucleotide sequence or amino acid sequence. Such directed molecules include, 
but are not limited to, a compound that binds to a ADAMTS2 nucleotide sequence or amino acid sequence 
referenced herein; a RNAi or siRNA molecule having a strand complementary or substantially 
complementary to a ADAMTS2 nucleotide sequence (e.g., hybridizes to a ADAMTS2 nucleotide 
sequence under conditions of high stringency); an antisense nucleic acid complementary or substantially 
complementary to an RNA encoded by a ADAMTS2 nucleotide sequence (e.g., hybridizes to a 
ADAMTS2 nucleotide sequence under conditions of high stringency); a ribozyme that hybridizes to a 
ADAMTS2 nucleotide sequence (e.g., hybridizes to a ADAMTS2 nucleotide sequence under conditions 
of high stringency); a nucleic acid aptamer that specifically binds a polypeptide encoded by ADAMTS2 
nucleotide sequence; and an antibody that specifically binds to a polypeptide encoded by ADAMTS2 
nucleotide sequence or binds to a nucleic acid having such a nucleotide sequence. In specific 
embodiments, the osteoarthritis directed molecule interacts with a nucleic acid or polypeptide variant 
associated with osteoarthritis, such as variants referenced herein. In other embodiments, the 
osteoarthritis directed molecule interacts with a polypeptide involved in a signal pathway of a 
polypeptide encoded by a ADAMTS2 nucleotide sequence, or a nucleic acid comprising such a nucleotide 
sequence. 

[0109] Compositions sometimes include an adjuvant known to stimulate an immune response, and 
in certain embodiments, an adjuvant that stimulates a T-cell lymphocyte response. Adjuvants are known, 
including but not limited to an aluminum adjuvant (e.g., aluminum hydroxide); a cytokine adjuvant or 
adjuvant that stimulates a cytokine response (e.g., interleukin (IL)-12 and/or gamma-interferon 
cytokines); a Freund-type mineral oil adjuvant emulsion (e.g., Freund's complete or incomplete 
adjuvant); a synthetic lipoid compound; a copolymer adjuvant (e.g., TitreMax); a saponin; Quil A; a 
liposome; an oil-in-water emulsion (e.g., an emulsion stabilized by Tween 80 and pluronic 
polyoxyethlene/polyoxypropylene block copolymer (Syntex Adjuvant Formulation); TitreMax; 
detoxified endotoxin (MPL) and mycobacterial cell wall components (TDW, CWS) in 2% squalene (Ribi 
Adjuvant System)); a muramyl dipeptide; an immune-stimulating complex (ISCOM, e.g., an Ag- 
modified saponin/cholesterol micelle that forms stable cage-like structure); an aqueous phase adjuvant 
that does not have a depot effect (e.g., Gerbu adjuvant); a carbohydrate polymer (e.g., AdjuPrime); L- 
tyrosine; a manide-oleate compound (e.g., Montanide); an ethylene-vinyl acetate copolymer (e.g., Elvax 
40W1.2); or lipid A, for example. Such compositions are useful for generating an immune response 
against osteoarthritis directed molecule (e.g., an HLA-binding subsequence within a polypeptide encoded 
by a ADAMTS2 nucleotide sequence). In such methods, a peptide having an amino acid subsequence of a 
polypeptide encoded by a ADAMTS2 nucleotide sequence is delivered to a subject, where the 
subsequence binds to an HLA molecule and induces a CTL lymphocyte response. The peptide 
sometimes is delivered to the subject as an isolated peptide or as a minigene in a plasmid that encodes the 
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peptide. Methods for identifying HLA-bindmg subsequences in such polypeptides are known (see e.g., 
publication WO02/20616 and PCT application US98/01373 for methods of identifying such sequences). 

[0110] The cell may be in a group of cells cultured in vitro or in a tissue maintained in vitro or 
present in an animal in vivo (e.g., a rat, mouse, ape or human). In certain embodiments, a composition 
comprises a component from a cell such as a nucleic acid molecule (e.g., genomic DNA), a protein 
mixture or isolated protein, for example. The aforementioned compositions have utility in diagnostic, 
prognostic and pharmacogenomic methods described previously and in therapeutics described hereafter. 
Certain osteoarthritis directed molecules are described in greater detail below. 

Compounds 

[0111] Compounds can be obtained using any of the numerous approaches in combinatorial library 
methods known in the art, including: biological libraries; peptoid libraries (libraries of molecules having 
the functionalities of peptides, but with a novel, non-peptide backbone which are resistant to enzymatic 
degradation but which nevertheless remain bioactive (see, e.g., Zuckermann et al., J. Med. Chem.37: 
2678-85 (1994)); spatially addressable parallel solid phase or solution phase libraries; synthetic library 
methods requiring deconvolution; "one-bead one-compound" library methods; and synthetic library 
methods using affinity chromatography selection. Biological library and peptoid library approaches are 
typically limited to peptide libraries, while the other approaches are applicable to peptide, non-peptide 
oligomer or small molecule libraries of compounds (Lam, Anticancer Drug Des. 12: 145, (1997)). 
Examples of methods for synthesizing molecular libraries are described, for example, in DeWitt et al., 
Proc. Natl. Acad. Sci. U.S.A. 90: 6909 (1993); Erb et al., Proc. Natl. Acad. Sci. USA 91: 1 1422 (1994); 
Zuckermann et al., J. Med. Chem. 37: 2678 (1994); Cho et al., Science 261: 1303 (1993); Carrell et al., 
Angew. Chem. Int. Ed. Engl. 33: 2059 (1994); Carell et al., Angew. Chem. Int. Ed. Engl. 33: 2061 
(1994); and in Gallop et al., J. Med. Chem. 37: 1233 (1994). 

[0112] Libraries of compounds may be presented in solution (e.g., Houghten, Biotechniques 13: 
412-421 (1992)), or on beads (Lam, Nature 354: 82-84 (1991)), chips (Fodor, Nature 364: 555-556 
(1993)), bacteria or spores (Ladner, United States Patent No. 5,223,409), plasmids (Cull et al., Proc. Natl. 
Acad. Sci. USA 89: 1865-1869 (1992)) or on phage (Scott and Smith, Science 249: 386-390 (1990); 
Devlin, Science 249: 404-406 (1990); Cwirla et al., Proc. Natl. Acad. Sci. 87: 6378-6382 (1990); Felici, 
J. Mol. Biol. 222: 301-310 (1991); Ladner supra.). 

[0113] A compound sometimes alters expression and sometimes alters activity of a polypeptide 
target and may be a small molecule. Small molecules include, but are not limited to, peptides, 
peptidomimetics (e.g., peptoids), amino acids, amino acid analogs, polynucleotides, polynucleotide 
analogs, nucleotides, nucleotide analogs, organic or inorganic compounds (i.e., including heteroorganic 
and organometallic compounds) having a molecular weight less than about 10,000 grams per mole, 



33 



PATENT 

Docket 524593008900 



organic or inorganic compounds having a molecular weight less than about 5,000 grams per mole, 
organic or inorganic compounds having a molecular weight less than about 1,000 grams per mole, 
organic or inorganic compounds having a molecular weight less than about 500 grams per mole, and 
salts, esters, and other pharmaceutically acceptable forms of such compounds. 

Antisense Nucleic Acid Molecules. Ribozymes, RNAi, siRNA and Modified Nucleic 
Acid Molecules 

[0114] An "antisense" nucleic acid refers to a nucleotide sequence complementary to a "sense" 
nucleic acid encoding a polypeptide, e.g., complementary to the coding strand of a double-stranded 
cDNA molecule or complementary to an mRNA sequence. The antisense nucleic acid can be 
complementary to an entire coding strand, or to a portion thereof or a substantially identical sequence 
thereof. In another embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region" 
of the coding strand of a nucleotide sequence (e.g., 5' and 3' untranslated regions in SEQ ED NO: 1). 

[0115] An antisense nucleic acid can be designed such that it is complementary to the entire coding 
region of an mRNA encoded by a nucleotide sequence (e.g., SEQ ID NO: 1), and often the antisense 
nucleic acid is an oligonucleotide antisense to only a portion of a coding or noncoding region of the 
mRNA. For example, the antisense oligonucleotide can be complementary to the region surrounding the 
translation start site of the mRNA, e.g., between the -10 and +10 regions of the target gene nucleotide 
sequence of interest. An antisense oligonucleotide can be, for example, about 7, 10, 15, 20, 25, 30, 35, 
40, 45, 50, 55, 60, 65, 70, 75, 80, or more nucleotides in length. The antisense nucleic acids, which 
include the ribozymes described hereafter, can be designed to target a ADAMTS2 nucleotide sequence, 
often a variant associated with osteoarthritis, or a substantially identical sequence thereof. Among the 
variants, minor alleles and major alleles can be targeted, and those associated with a higher risk of 
osteoarthritis are often designed, tested, and administered to subjects. 

[0116] An antisense nucleic acid can be constructed using chemical synthesis and enzymatic ligation 
reactions using standard procedures. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or variously 
modified nucleotides designed to increase the biological stability of the molecules or to increase the 
physical stability of the duplex formed between the antisense and sense nucleic acids, e.g., 
phosphorothioate derivatives and acridine substituted nucleotides can be used. Antisense nucleic acid 
also can be produced biologically using an expression vector into which a nucleic acid has been 
subcloned in an antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of interest, described further in the following subsection). 

[0117] When utilized as therapeutics, antisense nucleic acids typically are administered to a subject 
(e.g., by direct injection at a tissue site) or generated in situ such that they hybridize with or bind to 
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cellular mRNA and/or genomic DNA encoding a polypeptide and thereby inhibit expression of the 
polypeptide, for example, by inhibiting transcription and/or translation. Alternatively, antisense nucleic 
acid molecules can be modified to target selected cells and then are administered systemically. For 
systemic administration, antisense molecules can be modified such that they specifically bind to receptors 
or antigens expressed on a selected cell surface, for example, by linking antisense nucleic acid molecules 
to peptides or antibodies which bind to cell surface receptors or antigens. Antisense nucleic acid 
molecules can also be delivered to cells using the vectors described herein. Sufficient intracellular 
concentrations of antisense molecules are achieved by incorporating a strong promoter, such as a pol II or 
pol III promoter, in the vector construct. 

[0118] Antisense nucleic acid molecules sometimes are alpha-anomeric nucleic acid molecules. An 
alpha-anomeric nucleic acid molecule forms specific double-stranded hybrids with complementary RNA 
in which, contrary to the usual beta-units, the strands run parallel to each other (Gaultier et al., Nucleic 
Acids. Res. 15: 6625-6641 (1987)). Antisense nucleic acid molecules can also comprise a 2'-o- 
methylribonucleotide (Inoue et al., Nucleic Acids Res. 15: 6131-6148 (1987)) or a chimeric RNA-DNA 
analogue (Inoue et al., FEBS Lett. 215: 327-330 (1987)). Antisense nucleic acids sometimes are 
composed of DNA or PNA or any other nucleic acid derivatives described previously. 

[0119] In another embodiment, an antisense nucleic acid is a ribozyme. A ribozyme having 
specificity for a ADAMTS2 nucleotide sequence can include one or more sequences complementary to 
such a nucleotide sequence, and a sequence having a known catalytic region responsible for mRNA 
cleavage (see e.g., U.S. Pat. No. 5,093,246 or Haselhoff and Gerlach, Nature 334: 585-591 (1988)). For 
example, a derivative of a Tetrahymena L-19 rVS RNA is sometimes utilized in which the nucleotide 
sequence of the active site is complementary to the nucleotide sequence to be cleaved in a mRNA (see 
e.g., Cech et al. U.S. Patent No. 4,987,071 ; and Cech et al. U.S. Patent No. 5,1 16,742). Also, target 
mRNA sequences can be used to select a catalytic RNA having a specific ribonuclease activity from a 
pool of RNA molecules (see e.g., Bartel & Szostak, Science 261: 141 1-1418 (1993)). 

[0120] Osteoarthritis directed molecules include in certain embodiments nucleic acids that can form 
triple helix structures with a ADAMTS2 nucleotide sequence, or a substantially identical sequence 
thereof, especially one that includes a regulatory region that controls expression of a polypeptide. Gene 
expression can be inhibited by targeting nucleotide sequences complementary to the regulatory region of 
a nucleotide sequence referenced herein or a substantially identical sequence (e.g., promoter and/or 
enhancers) to form triple helical structures that prevent transcription of a gene in target cells (see e.g., 
Helene, Anticancer Drug Des. 6(6): 569-84 (1991); Helene et al., Ann. N.Y. Acad. Sci. 660: 27-36 
(1992); and Maher, Bioassays 14(12): 807-15 (1992). Potential sequences that can be targeted for triple 
helix formation can be increased by creating a so-called "switchback" nucleic acid molecule. Switchback 
molecules are synthesized in an alternating 5 '-3', 3 '-5' manner, such that they base pair with first one 
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strand of a duplex and then the other, eliminating the necessity for a sizeable stretch of either purines or 
pyrimidines to be present on one strand of a duplex. 

|0121] Osteoarthritis directed molecules include RNAi and siRNA nucleic acids. Gene expression 
may be inhibited by the introduction of double-stranded RNA (dsRNA), which induces potent and 
specific gene silencing, a phenomenon called RNA interference or RNAi. See, e.g., Fire et al., US Patent 
Number 6,506,559; Tuschl et al. PCT International Publication No. WO 01/75164; Kay et al. PCT 
International Publication No. WO 03/010180A1; or Bosher JM, Labouesse, Nat Cell Biol 2000 
Feb;2(2):E31-6. This process has been improved by decreasing the size of the double-stranded RNA to 
20-24 base pairs (to create small-interfering RNAs or siRNAs) that "switched off genes in mammalian 
cells without initiating an acute phase response, i.e., a host defense mechanism that often results in cell 
death (see, e.g., Caplen et al. Proc Natl Acad Sci USA. 2001 Aug 14;98(17):9742-7 and Elbashir et al. 
Methods 2002 Feb;26(2): 199-213). There is increasing evidence of post-transcriptional gene silencing 
by RNA interference (RNAi) for inhibiting targeted expression in mammalian cells at the mRNA level, 
in human cells. There is additional evidence of effective methods for inhibiting the proliferation and 
migration of tumor cells in human patients, and for inhibiting metastatic cancer development (see, e.g., 
U.S. Patent Application No. US2001000993183; Caplen et al. Proc Natl Acad Sci USA; and 
Abderrahmani et al. Mol Cell Biol 2001 Nov21(21):7256-67). 

[0122] An "siRNA" or "RNAi" refers to a nucleic acid that forms a double stranded RNA and has 
the ability to reduce or inhibit expression of a gene or target gene when the siRNA is delivered to or 
expressed in the same cell as the gene or target gene. "siRNA" refers to short double-stranded RNA 
formed by the complementary strands. Complementary portions of the siRNA that hybridize to form the 
double stranded molecule often have substantial or complete identity to the target molecule sequence. In 
one embodiment, an siRNA refers to a nucleic acid that has substantial or complete identity to a target 
gene and forms a double stranded siRNA. 

[0123] When designing the siRNA molecules, the targeted region often is selected from a given 
DNA sequence beginning 50 to 100 nucleotides downstream of the start codon. See, e.g., Elbashir et al,. 
Methods 26:199-213 (2002). Initially, 5' or 3' UTRs and regions nearby the start codon were avoided 
assuming that UTR-binding proteins and/or translation initiation complexes may interfere with binding of 
the siRNP or RISC endonuclease complex. Sometimes regions of the target 23 nucleotides in length 
conforming to the sequence motif AA(N19)TT (N, an nucleotide), and regions with approximately 30% 
to 70% G/C-content (often about 50% G/C-content) often are selected. If no suitable sequences are 
found, the search often is extended using the motif NA(N21). The sequence of the sense siRNA 
sometimes corresponds to (N19) TT or N21 (position 3 to 23 of the 23-nt motif), respectively. In the 
latter case, the 3' end of the sense siRNA often is converted to TT. The rationale for this sequence 
conversion is to generate a symmetric duplex with respect to the sequence composition of the sense and 
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antisense 3' overhangs. The antisense siRNA is synthesized as the complement to position 1 to 21 of the 
23-nt motif. Because position 1 of the 23-nt motif is not recognized sequence-specifically by the 
antisense siRNA, the 3 '-most nucleotide residue of the antisense siRNA can be chosen deliberately. 
However, the penultimate nucleotide of the antisense siRNA (complementary to position 2 of the 23-nt 
motif) often is complementary to the targeted sequence. For simplifying chemical synthesis, TT often is 
utilized. siRNAs corresponding to the target motif NARfN 1 7)YNN, where R is purine (A,G) and Y is 
pyrimidine (C,U), often are selected. Respective 21 nucleotide sense and antisense siRNAs often begin 
with a purine nucleotide and can also be expressed from pol III expression vectors without a change in 
targeting site. Expression of RNAs from pol III promoters often is efficient when the first transcribed 
nucleotide is a purine. 

[0124] The sequence of the siRNA can correspond to the full length target gene, or a subsequence 
thereof. Often, the siRNA is about 15 to about 50 nucleotides in length (e.g., each complementary 
sequence of the double stranded siRNA is 15-50 nucleotides in length, and the double stranded siRNA is 
about 15-50 base pairs in length, sometimes about 20-30 nucleotides in length or about 20-25 nucleotides 
in length, e.g., 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides in length. The siRNA sometimes 
is about 21 nucleotides in length. Methods of using siRNA are well known in the art, and specific siRNA 
molecules may be purchased from a number of companies including Dharmacon Research, Inc. 

[0125] Antisense, ribozyme, RNAi and siRNA nucleic acids can be altered to form modified nucleic 
acid molecules. The nucleic acids can be altered at base moieties, sugar moieties or phosphate backbone 
moieties to improve stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
phosphate backbone of nucleic acid molecules can be modified to generate peptide nucleic acids (see 
Hyrup et al., Bioorganic & Medicinal Chemistry 4 (1): 5-23 (1996)). As used herein, the terms "peptide 
nucleic acid" or "PNA" refers to a nucleic acid mimic such as a DNA mimic, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide backbone and only the four natural nucleobases are 
retained. The neutral backbone of a PNA can allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. Synthesis of PNA oligomers can be performed using standard solid 
phase peptide synthesis protocols as described, for example, in Hyrup et al., (1996) supra and Perry- 
O'Keefe et al., Proc. Natl. Acad. Sci. 93: 14670-675 (1996). 

[0126] PNA nucleic acids can be used in prognostic, diagnostic, and therapeutic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene 
expression by, for example, inducing transcription or translation arrest or inhibiting replication. PNA 
nucleic acid molecules can also be used in the analysis of single base pair mutations in a gene, (e.g., by 
PNA-directed PCR clamping); as "artificial restriction enzymes" when used in combination with other 
enzymes, (e.g., SI nucleases (Hyrup (1996) supra)); or as probes or primers for DNA sequencing or 
hybridization (Hyrup et al., (1996) supra; Perry-O'Keefe supra). 
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[0127] In other embodiments, oligonucleotides may include other appended groups such as peptides 
(e.g., for targeting host cell receptors in vivo), or agents facilitating transport across cell membranes (see 
e.g., Letsinger et al., Proc. Natl. Acad. Sci. USA 86: 6553-6556 (1989); Lemaitre et al., Proc. Natl. Acad. 
Sci. USA 84: 648-652 (1987); PCT Publication No. W088/09810) or the blood-brain barrier (see, e.g., 
PCT Publication No. W089/10134). In addition, oligonucleotides can be modified with hybridization- 
triggered cleavage agents (See, e.g., Krol et al., Bio-Techniques 6: 958-976 (1988)) or intercalating 
agents. (See, e.g., Zon, Pharm. Res. 5: 539-549 (1988) ). To this end, the oligonucleotide may be 
conjugated to another molecule, (e.g., a peptide, hybridization triggered cross-linking agent, transport 
agent, or hybridization-triggered cleavage agent). 

[0128] Also included herein are molecular beacon oligonucleotide primer and probe molecules 
having one or more regions complementary to a ADAMTS2 nucleotide sequence, or a substantially 
identical sequence thereof, two complementary regions one having a fluorophore and one a quencher 
such that the molecular beacon is useful for quantifying the presence of the nucleic acid in a sample. 
Molecular beacon nucleic acids are described, for example, in Lizardi et al., U.S. Patent No. 5,854,033; 
Nazarenko et al., U.S. Patent No. 5,866,336, and Livak et al., U.S. Patent 5,876,930. 

Antibodies 

[0129] The term "antibody" as used herein refers to an immunoglobulin molecule or 
immunologically active portion thereof, i.e., an antigen-binding portion. Examples of immunologically 
active portions of immunoglobulin molecules include F(ab) and F(ab') 2 fragments which can be 
generated by treating the antibody with an enzyme such as pepsin. An antibody sometimes is a 
polyclonal, monoclonal, recombinant (e.g., a chimeric or humanized), fully human, non-human (e.g., 
murine), or a single chain antibody. An antibody may have effector function and can fix complement, 
and is sometimes coupled to a toxin or imaging agent. 

[0130] A full-length polypeptide or antigenic peptide fragment encoded by a nucleotide sequence 
referenced herein can be used as an immunogen or can be used to identify antibodies made with other 
immunogens, e.g., cells, membrane preparations, and the like. An antigenic peptide often includes at 
least 8 amino acid residues of the ammo acid sequences encoded by a nucleotide sequence referenced 
herein, or substantially identical sequence thereof, and encompasses an epitope. Antigenic peptides 
sometimes include 10 or more amino acids, 15 or more amino acids, 20 or more amino acids, or 30 or 
more amino acids. Hydrophilic and hydrophobic fragments of polypeptides sometimes are used as 
immunogens. 

[0131] Epitopes encompassed by the antigenic peptide are regions located on the surface of the 
polypeptide (e.g., hydrophilic regions) as well as regions with high antigenicity. For example, an Emini 
surface probability analysis of the human polypeptide sequence can be used to indicate the regions that 
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have a particularly high probability of being localized to the surface of the polypeptide and are thus likely 
to constitute surface residues useful for targeting antibody production. The antibody may bind an epitope 
on any domain or region on polypeptides described herein. 

|0132] Also, chimeric, humanized, and completely human antibodies are useful for applications 
which include repeated administration to subjects. Chimeric and humanized monoclonal antibodies, 
comprising both human and non-human portions, can be made using standard recombinant DNA 
techniques. Such chimeric and humanized monoclonal antibodies can be produced by recombinant DNA 
techniques known in the art, for example using methods described in Robinson et al International 
Application No. PCT/US86/02269; Akira, et al European Patent Application 184,187; Taniguchi, M., 
European Patent Application 171,496; Morrison et al European Patent Application 173,494; Neuberger et 
al PCT International Publication No. WO 86/01533; Cabilly et al U.S. Patent No. 4,816,567; Cabilly et al 
European Patent Application 125,023; Better et al., Science 240: 1041-1043 (1988); Liu et al., Proc. Natl. 
Acad. Sci. USA 84: 3439-3443 (1987); Liu et al., J. Immunol. 139: 3521-3526 (1987); Sun et al., Proc. 
Natl. Acad. Sci. USA 84: 214-218 (1987); Nishimura et al., Cane. Res. 47: 999-1005 (1987); Wood et al., 
Nature 314: 446-449 (1985); and Shaw et al., J. Natl. Cancer Inst. 80: 1553-1559 (1988); Morrison, S. 
L., Science 229: 1202-1207 (1985); Oi et al., BioTechniques 4: 214 (1986); Winter U.S. Patent 
5,225,539; Jones et al., Nature 321 : 552-525 (1986); Verhoeyan et al., Science 239: 1534; and Beidler et 
al., J. Immunol. 141: 4053-4060 (1988). 

|0133] Completely human antibodies are particularly desirable for therapeutic treatment of human 
patients. Such antibodies can be produced using transgenic mice that are incapable of expressing 
endogenous immunoglobulin heavy and light chains genes, but which can express human heavy and light 
chain genes. See, for example, Lonberg and Huszar, Int. Rev. Immunol. 13: 65-93 (1995); and U.S. 
Patent Nos. 5,625,126; 5,633,425; 5,569,825; 5,661,016; and 5,545,806. In addition, companies such as 
Abgenix, Inc. (Fremont, CA) and Medarex, Inc. (Princeton, NJ), can be engaged to provide human 
antibodies directed against a selected antigen using technology similar to that described above. 
Completely human antibodies that recognize a selected epitope also can be generated using a technique 
referred to as "guided selection." In this approach a selected non-human monoclonal antibody (e.g., a 
murine antibody) is used to guide the selection of a completely human antibody recognizing the same 
epitope. This technology is described for example by Jespers et al., Bio/Technology 12: 899-903 (1994). 

[0134] An antibody can be a single chain antibody. A single chain antibody (scFV) can be 
engineered (see, e.g., Colcher et al., Ann. N Y Acad. Sci. 880: 263-80 (1999); and Reiter, Clin. Cancer 
Res. 2: 245-52 (1996)). Single chain antibodies can be dimerized or multimerized to generate 
multivalent antibodies having specificities for different epitopes of the same target polypeptide. 

[0135] Antibodies also may be selected or modified so that they exhibit reduced or no ability to bind 
an Fc receptor. For example, an antibody may be an isotype or subtype, fragment or other mutant, which 
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does not support binding to an Fc receptor (e.g., it has a mutagenized or deleted Fc receptor binding 
region). 

[0136] Also, an antibody (or fragment thereof) may be conjugated to a therapeutic moiety such as a 
cytotoxin, a therapeutic agent or a radioactive metal ion. A cytotoxin or cytotoxic agent includes any 
agent that is detrimental to cells. Examples include taxol, cytochalasin B, gramicidin D, ethidium 
bromide, emetine, mitomycin, etoposide, tenoposide, vincristine, vinblastine, colchicin, doxorubicin, 
daunorubicin, dihydroxy anthracin dione, mitoxantrone, mithramycin, actinomycin D, 1 
dehydrotestosterone, glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and puromycin and 
analogs or homologs thereof. Therapeutic agents include, but are not limited to, antimetabolites (e.g., 
methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouracil decarbazine), alkylating agents 
(e.g., mechlorethamine, thiotepa chlorambucil, melphalan, carmustine (BCNU) and lomustine (CCNU), 
cyclophosphamide, busulfan, dibromomannitol, streptozotocin, mitomycin C, and cis-dichlorodiamine 
platinum (II) (DDP) cisplatin), anthracyclines (e.g., daunorubicin (formerly daunomycin) and 
doxorubicin), antibiotics (e.g., dactinomycin (formerly actinomycin), bleomycin, mithramycin, and 
anthramycin (AMC)), and anti-mitotic agents (e.g., vincristine and vinblastine). 

[0137] Antibody conjugates can be used for modifying a given biological response. For example, 
the drug moiety may be a protein or polypeptide possessing a desired biological activity. Such proteins 
may include, for example, a toxin such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a 
polypeptide such as tumor necrosis factor, gamma-interferon, alpha -interferon, nerve growth factor, 
platelet derived growth factor, tissue plasminogen activator; or, biological response modifiers such as, for 
example, lymphokines, interleukin-1 ("fL-l"), interleukin-2 ("IL-2"), interleukin-6 ("IL-6"), granulocyte 
macrophage colony stimulating factor ("GM-CSF"), granulocyte colony stimulating factor ("G-CSF"), or 
other growth factors. Also, an antibody can be conjugated to a second antibody to form an antibody 
heteroconjugate as described by Segal in U.S. Patent No. 4,676,980, for example. 

[0138] An antibody (e.g., monoclonal antibody) can be used to isolate target polypeptides by 
standard techniques, such as affinity chromatography or immunoprecipitation. Moreover, an antibody 
can be used to detect a target polypeptide (e.g., in a cellular lysate or cell supernatant) in order to evaluate 
the abundance and pattern of expression of the polypeptide. Antibodies can be used diagnostically to 
monitor polypeptide levels in tissue as part of a clinical testing procedure, e.g., to determine the efficacy 
of a given treatment regimen. Detection can be facilitated by coupling (i.e., physically linking) the 
antibody to a detectable substance (i.e., antibody labeling). Examples of detectable substances include 
various enzymes, prosthetic groups, fluorescent materials, luminescent materials, bioluminescent 
materials, and radioactive materials. Examples of suitable enzymes include horseradish peroxidase, 
alkaline phosphatase, /3-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group 
complexes include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials 
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include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes luminol; 
examples of bioluminescent materials include luciferase, luciferin, and aequorin, and examples of 
suitable radioactive material include ,2s I, 131 1, 3:> S or 3 H. Also, an antibody can be utilized as a test 
molecule for determining whether it can treat osteoarthritis, and as a therapeutic for administration to a 
subject for treating osteoarthritis. 

[0139] An antibody can be made by immunizing with a purified antigen, or a fragment thereof, e.g., 
a fragment described herein, a membrane associated antigen, tissues, e.g., crude tissue preparations, 
whole cells, preferably living cells, lysed cells, or cell fractions. 

[0140] Included herein are antibodies which bind only a native polypeptide, only denatured or 
otherwise non-native polypeptide, or which bind both, as well as those having linear or conformational 
epitopes. Conformational epitopes sometimes can be identified by selecting antibodies that bind to 
native but not denatured polypeptide. Also featured are antibodies that specifically bind to a polypeptide 
variant associated with osteoarthritis. 

Methods for Identifying Candidate Therapeutics for Treating Osteoarthritis 
[0141] Current therapies for the treatment of osteoarthritis have limited efficacy, limited tolerability 
and significant mechanism-based side effects, and few of the available therapies adequately address 
underlying defects. Current therapeutic approaches were largely developed in the absence of defined 
molecular targets or even a solid understanding of disease pathogenesis. Therefore, provided are 
methods of identifying candidate therapeutics that target biochemical pathways related to the 
development of osteoarthritis. 

[0142] Thus, featured herein are methods for identifying a candidate therapeutic for treating 
osteoarthritis. The methods comprise contacting a test molecule with a target molecule in a system. A 
"target molecule" as used herein refers to a ADAMTS2 nucleic acid, a substantially identical nucleic acid 
thereof, or a fragment thereof, and an encoded polypeptide of the foregoing. The methods also comprise 
determining the presence or absence of an interaction between the test molecule and the target molecule, 
where the presence of an interaction between the test molecule and the nucleic acid or polypeptide 
identifies the test molecule as a candidate osteoarthritis therapeutic. The interaction between the test 
molecule and the target molecule may be quantified. 

[0143] Test molecules and candidate therapeutics include, but are not limited to, compounds, 
antisense nucleic acids, siRNA molecules, ribozymes, polypeptides or proteins encoded by a ADAMTS2 
nucleotide sequence, or a substantially identical sequence or fragment thereof, and immunotherapeutics 
(e.g., antibodies and HLA-presented polypeptide fragments). A test molecule or candidate therapeutic 
may act as a modulator of target molecule concentration or target molecule function in a system. A 
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"modulator" may agonize (i.e., up-regulates) or antagonize (i.e., down-regulates) a target molecule 
concentration partially or completely in a system by affecting such cellular functions as DNA replication 
and/or DNA processing (e.g., DNA methylation or DNA repair), RNA transcription and/or RNA 
processing (e.g., removal of intronic sequences and/or translocation of spliced mRNA from the nucleus), 
polypeptide production (e.g., translation of the polypeptide from mRNA), and/or polypeptide post- 
translational modification (e.g., glycosylation, phosphorylation, and proteolysis of pro-polypeptides). A 
modulator may also agonize or antagonize a biological function of a target molecule partially or 
completely, where the function may include adopting a certain structural conformation, interacting with 
one or more binding partners, ligand binding, catalysis (e.g., phosphorylation, dephosphorylation, 
hydrolysis, methylation, and isomerization), and an effect upon a cellular event (e.g., effecting 
progression of osteoarthritis). 

[0144] As used herein, the term "system" refers to a cell free in vitro environment and a cell-based 
environment such as a collection of cells, a tissue, an organ, or an organism. A system is "contacted" 
with a test molecule in a variety of manners, including adding molecules in solution and allowing them to 
interact with one another by diffusion, cell injection, and any administration routes in an animal. As used 
herein, the term "interaction" refers to an effect of a test molecule on test molecule, where the effect 
sometimes is binding between the test molecule and the target molecule, and sometimes is an observable 
change in cells, tissue, or organism. 

[0145] There are many standard methods for detecting the presence or absence of interaction 
between a test molecule and a target molecule. For example, titrametric, acidimetric, radiometric, NMR, 
monolayer, polarographic, spectrophotometric, fluorescent, and ESR assays probative of a target 
molecule interaction may be utilized. 

[0146] ADAMTS2 activity and/or ADAMTS2 interactions can be detected and quantified using 
assays known in the art. For example, an immunoprecipitation assay or a kinase activity assay that 
employs a kinase-inactivated MEK can be utilized. Kinase inactivated MEKs are known in the art, such 
as a MEK that includes the mutation K97M. In these assays, mammalian cells (e.g., COS or NIH-3T3) 
are transiently transfected with constructs expressing ADAMTS2, and in addition, the cells are co- 
transfected with oncogenic RAS or SRC or both. Oncogenic RAS or SRC activates ADAMTS2 kinase 
activity. ADAMTS2 is immunoprecipitated from cell extracts using a monoclonal antibody (e.g., 9E10) 
or a polyclonal antibody (e.g., from rabbit) specific for a unique peptide from ADAMTS2. ADAMTS2 is 
then resuspended in assay buffer containing GST-Mekl or GST-Mek2 and/or GST-ERK2. In addition, 
[gamma 32 P] ATP can be added to detect and/or quantify phosphorylation activity. Samples are 
incubated for 5-30 minutes at 30°C, and then the reaction is terminated by addition of EDTA. The 
samples are centnfuged and the supernatant fractions are collected. Phosphorylation activity is detected 
using one of two methods: (i) activity of GST-ERK2 kinase can be measured using MBP (myelin basic 
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protein, a substrate for ERK) as substrate, or (ii) following incubation of immunoprecipitated ADAMTS2 
in reaction buffer containing GST-ERK and [gamma 32 P] ATP, transfer of labeled ATP to kinase-dead 
ERK can be quantified by a phosphor-imager or densitometer following PAGE separation of polypeptide 
products (phosphorylated and non-phosphorylated forms). These types of assays are described in Weber 
et al., Oncogene 19: 169-176 (2000); Mason et al , EMBO J. 18:2137-2148(1999); Marais et al., J. Biol. 
Chem. 272: 4378-4383 (1997); Marais et al., EMBO J. 14: 3136-3145 (1995). 

[0147] As noted above, ADAMTS2 includes a domain having metal loprotease activity, and 
modulators of such activity are known. Examples of such modulators are set forth in WO03063762A2; 
WO-09937625; WO-09918076; WO-09838163; WO-09837877; WO9947550A1; WO0177092A1; 
WO0040577A1; W09942436A1; W09838163A1; W09837877A1; WO04014379A1; WO03106381A2; 
WO03014098A1; WO03014092A1 and WO02096426A1. 

[0148] Test molecule/target molecule interactions can be detected and/or quantified using assays 
known in the art. For example, an interaction can be determined by labeling the test molecule and/or the 
target molecule, where the label is covalently or non-covalently attached to the test molecule or target 
molecule. The label is sometimes a radioactive molecule such as 125 I, U, I, 3 'S or 3 H, which can be 
detected by direct counting of radioemission or by scintillation counting. Also, enzymatic labels such as 
horseradish peroxidase, alkaline phosphatase, or luciferase may be utilized where the enzymatic label can 
be detected by determining conversion of an appropriate substrate to product. In addition, presence or 
absence of an interaction can be determined without labeling. For example, a microphysiometer {e.g. , 
Cytosensor) is an analytical instrument that measures the rate at which a cell acidifies its environment 
using a light-addressable potentiometnc sensor (LAPS). Changes in this acidification rate can be used as 
an indication of an interaction between a test molecule and target molecule (McConnell, H. M. et al, 
Science 257: 1906-1912 (1992)). 

[0149] In cell-based systems, cells typically include a ADAMTS2 nucleic acid, an encoded 
polypeptide, or substantially identical nucleic acid or polypeptide thereof, and are often of mammalian 
origin, although the cell can be of any origin. Whole cells, cell homogenates, and cell fractions (e.g., cell 
membrane fractions) can be subjected to analysis. Where interactions between a test molecule with a 
target polypeptide are monitored, soluble and/or membrane bound forms of the polypeptide may be 
utilized. Where membrane-bound forms of the polypeptide are used, it may be desirable to utilize a 
solubilizing agent. Examples of such solubilizing agents include non-ionic detergents such as n- 
octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N- 
methylglucamide, Triton® X-100, Triton® X-l 14, Thesit®, Isotridecypoly(ethylene glycol ether) n , 3- 
[(3-cholamidopropyl)dimethylamminio]-l -propane sulfonate (CHAPS), 3-[(3- 
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cholamidopropyl)dimethylamminio]-2-hydroxy-l -propane sulfonate (CHAPSO), or N-dodecyl-N,N- 
dimethyl-3-ammonio-l -propane sulfonate. 

[0150] An interaction between a test molecule and target molecule also can be detected by 
monitoring fluorescence energy transfer (FET) {see, e.g., Lakowicz et al, U.S. Patent No. 5,631,169; 
Stavrianopoulos et al. U.S. Patent No. 4,868,103). A fluorophore label on a first, "donor" molecule is 
selected such that its emitted fluorescent energy will be absorbed by a fluorescent label on a second, 
"acceptor" molecule, which in turn is able to fluoresce due to the absorbed energy. Alternately, the 
"donor" polypeptide molecule may simply utilize the natural fluorescent energy of tryptophan residues. 
Labels are chosen that emit different wavelengths of light, such that the "acceptor" molecule label may 
be differentiated from that of the "donor". Since the efficiency of energy transfer between the labels is 
related to the distance separating the molecules, the spatial relationship between the molecules can be 
assessed. In a situation in which binding occurs between the molecules, the fluorescent emission of the 
"acceptor" molecule label in the assay should be maximal. An FET binding event can be conveniently 
measured through standard fluorometric detection means well known in the art {e.g., using a fluorimeter). 

[0151] In another embodiment, determining the presence or absence of an interaction between a test 
molecule and a target molecule can be effected by monitoring surface plasmon resonance {see, e.g., 
Sjolander & Urbaniczk, Anal. Chem. 63: 2338-2345 (1991) and Szabo et al, Curr. Opin. Struct. Biol. 5: 
699-705 (1995)). "Surface plasmon resonance" or "biomolecular interaction analysis (BIA)" can be 
utilized to detect biospecific interactions in real time, without labeling any of the interactants {e.g., 
BIAcore). Changes in the mass at the binding surface (indicative of a binding event) result in alterations 
of the refractive index of light near the surface (the optical phenomenon of surface plasmon resonance 
(SPR)), resulting in a detectable signal which can be used as an indication of real-time reactions between 
biological molecules. 

[0152] In another embodiment, the target molecule or test molecules are anchored to a solid phase, 
facilitating the detection of target molecule/test molecule complexes and separation of the complexes 
from free, uncomplexed molecules. The target molecule or test molecule is immobilized to the solid 
support. In an embodiment, the target molecule is anchored to a solid surface, and the test molecule, 
which is not anchored, can be labeled, either directly or indirectly, with detectable labels discussed 
herein. 

[0153] It may be desirable to immobilize a target molecule, an anti-target molecule antibody, and/or 
test molecules to facilitate separation of target molecule/test molecule complexes from uncomplexed 
forms, as well as to accommodate automation of the assay. The attachment between a test molecule 
and/or target molecule and the solid support may be covalent or non-covalent {see, e.g., U.S. Patent No. 
6,022,688 for non-covalent attachments). The solid support may be one or more surfaces of the system, 
such as one or more surfaces in each well of a microtiter plate, a surface of a silicon wafer, a surface of a 
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bead {see, e.g., Lam, Nature 354: 82-84 (1991)) that is optionally linked to another solid support, or a 
channel in a microfluidic device, for example. Types of solid supports, linker molecules for covalent and 
non-covalent attachments to solid supports, and methods for immobilizing nucleic acids and other 
molecules to solid supports are well known (see, e.g., U.S. Patent Nos. 6,261,776; 5,900,481; 6,133,436; 
and 6,022,688; and WIPO publication WO 01/18234). 

|0154] In an embodiment, target molecule may be immobilized to surfaces via biotin and 
streptavidin. For example, biotinylated target polypeptide can be prepared from biotin-NHS (N-hydroxy- 
succinimide) using techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), 
and immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). In another 
embodiment, a target polypeptide can be prepared as a fusion polypeptide. For example, glutathione-S- 
transferase/target polypeptide fusion can be adsorbed onto glutathione sepharose beads (Sigma Chemical, 
St. Louis, MO) or glutathione derivitized microtiter plates, which are then combined with a test molecule 
under conditions conducive to complex formation (e.g., at physiological conditions for salt and pH). 
Following incubation, the beads or microtiter plate wells are washed to remove any unbound 
components, or the matrix is immobilized in the case of beads, and complex formation is determined 
directly or indirectly as described above. Alternatively, the complexes can be dissociated from the 
matrix, and the level of target molecule binding or activity is determined using standard techniques. 

[0155] In an embodiment, the non-immobilized component is added to the coated surface containing 
the anchored component. After the reaction is complete, unreacted components are removed (e.g., by 
washing) under conditions such that a significant percentage of complexes formed will remain 
immobilized to the solid surface. The detection of complexes anchored on the solid surface can be 
accomplished in a number of manners. Where the previously non-immobilized component is pre-labeled, 
the detection of label immobilized on the surface indicates that complexes were formed. Where the 
previously non-immobilized component is not pre-labeled, an indirect label can be used to detect 
complexes anchored on the surface, e.g., by adding a labeled antibody specific for the immobilized 
component, where the antibody, in turn, can be directly labeled or indirectly labeled with, e.g., a labeled 
anti-Ig antibody. 

[0156] In another embodiment, an assay is performed utilizing antibodies that specifically bind 
target molecule or test molecule but do not interfere with binding of the target molecule to the test 
molecule. Such antibodies can be derivitized to a solid support, and unbound target molecule may be 
immobilized by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of complexes using 
antibodies reactive with the target molecule, as well as enzyme -linked assays which rely on detecting an 
enzymatic activity associated with the target molecule. 
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[0157] Cell free assays also can be conducted in a liquid phase. In such an assay, reaction products 
are separated from unreacted components, by any of a number of standard techniques, including but not 
limited to: differential centrifugation (see, e.g., Rivas, G., and Minton, Trends Biochem Sci Aug; 18(8): 
284-7 (1993)); chromatography (gel filtration chromatography, ion-exchange chromatography); 
electrophoresis (see, e.g., Ausubel el al., eds. Current Protocols in Molecular Biology , J. Wiley: New 
York (1999)); and immunoprecipitation (see, e.g., Ausubel et al., eds., supra). Media and 
chromatographic techniques are known to one skilled in the art (see, e.g., Heegaard, J Mol. Recognit. 
Winter; 11(1-6): 141-8 (1998); Hage & Tweed, J. Chromatogr. B Biomed. Sci. Appl. Oct 10; 699 (1-2): 
499-525 (1997)). Further, fluorescence energy transfer may also be conveniently utilized, as described 
herein, to detect binding without further purification of the complex from solution. 

[0158] In another embodiment, modulators of target molecule expression are identified. For 
example, a cell or cell free mixture is contacted with a candidate compound and the expression of target 
mRNA or target polypeptide is evaluated relative to the level of expression of target mRNA or target 
polypeptide in the absence of the candidate compound. When expression of target mRNA or target 
polypeptide is greater in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an agonist of target mRNA or target polypeptide expression. Alternatively, 
when expression of target mRNA or target polypeptide is less (e.g. , less with statistical significance) in 
the presence of the candidate compound than in its absence, the candidate compound is identified as an 
antagonist or inhibitor of target mRNA or target polypeptide expression. The level of target mRNA or 
target polypeptide expression can be determined by methods described herein. 

[0159] In another embodiment, binding partners that interact with a target molecule are detected. 
The target molecules can interact with one or more cellular or extracellular macromolecules, such as 
polypeptides in vivo, and these interacting molecules are referred to herein as "binding partners." 
Binding partners can agonize or antagonize target molecule biological activity. Also, test molecules that 
agonize or antagonize interactions between target molecules and binding partners can be useful as 
therapeutic molecules as they can up-regulate or down-regulated target molecule activity in vivo and 
thereby treat osteoarthritis. 

[0160] Binding partners of target molecules can be identified by methods known in the art. For 
example, binding partners may be identified by lysing cells and analyzing cell lysates by electrophoretic 
techniques. Alternatively, a two-hybrid assay or three-hybrid assay can be utilized (see, e.g., U.S. Patent 
No. 5,283,317; Zervos et al., Cell 72:223-232 (1993); Madura et al, J. Biol. Chem. 268: 12046-12054 
(1993); Battel et al., Biotechniques 14: 920-924 (1993); Iwabuchi et al., Oncogene 8: 1693-1696 (1993); 
and Brent WO94/10300). A two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. The assay often utilizes two 
different DNA constructs. In one construct, a ADAMTS2 nucleic acid (sometimes referred to as the 
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"bait") is fused to a gene encoding the DNA binding domain of a known transcription factor (e.g., GAL- 
4). In another construct, a DNA sequence from a library of DNA sequences that encodes a potential 
binding partner (sometimes referred to as the "prey") is fused to a gene that encodes an activation domain 
of the known transcription factor. Sometimes, a ADAMTS2 nucleic acid can be fused to the activation 
domain. If the "bait" and the "prey" molecules interact in vivo, the DNA-binding and activation domains 
of the transcription factor are brought into close proximity. This proximity allows transcription of a 
reporter gene (e.g., LacZ) which is operably linked to a transcriptional regulatory site responsive to the 
transcription factor. Expression of the reporter gene can be detected and cell colonies containing the 
functional transcription factor can be isolated and used to identify the potential binding partner. 

[0161] In an embodiment for identifying test molecules that antagonize or agonize complex 
formation between target molecules and binding partners, a reaction mixture containing the target 
molecule and the binding partner is prepared, under conditions and for a time sufficient to allow complex 
formation. The reaction mixture often is provided in the presence or absence of the test molecule. The 
test molecule can be included initially in the reaction mixture, or can be added at a time subsequent to the 
addition of the target molecule and its binding partner. Control reaction mixtures are incubated without 
the test molecule or with a placebo. Formation of any complexes between the target molecule and the 
binding partner then is detected. Decreased formation of a complex in the reaction mixture containing 
test molecule as compared to in a control reaction mixture indicates that the molecule antagonizes target 
molecule/binding partner complex formation. Alternatively, increased formation of a complex in the 
reaction mixture containing test molecule as compared to in a control reaction mixture indicates that the 
molecule agonizes target molecule/binding partner complex formation. In another embodiment, complex 
formation of target molecule/binding partner can be compared to complex formation of mutant target 
molecule/binding partner (e.g., amino acid modifications in a target polypeptide). Such a comparison can 
be important in those cases where it is desirable to identify test molecules that modulate interactions of 
mutant but not non-mutated target gene products. 

[0162] The assays can be conducted in a heterogeneous or homogeneous format. In heterogeneous 
assays, target molecule and/or the binding partner are immobilized to a solid phase, and complexes are 
detected on the solid phase at the end of the reaction. In homogeneous assays, the entire reaction is 
carried out in a liquid phase. In either approach, the order of addition of reactants can be varied to obtain 
different information about the molecules being tested. For example, test compounds that agonize target 
molecule/binding partner interactions can be identified by conducting the reaction in the presence of the 
test molecule in a competition format. Alternatively, test molecules that agonize preformed complexes, 
e.g., molecules with higher binding constants that displace one of the components from the complex, can 
be tested by adding the test compound to the reaction mixture after complexes have been formed. 
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[0163] In a heterogeneous assay embodiment, the target molecule or the binding partner is anchored 
onto a solid surface (e.g., a microtiter plate), while the non-anchored species is labeled, either directly or 
indirectly. The anchored molecule can be immobilized by non-covalent or covalent attachments. 
Alternatively, an immobilized antibody specific for the molecule to be anchored can be used to anchor 
the molecule to the solid surface. The partner of the immobilized species is exposed to the coated surface 
with or without the test molecule. After the reaction is complete, unreacted components are removed 
(e.g., by washing) such that a significant portion of any complexes formed will remain immobilized on 
the solid surface. Where the non-immobilized species is pre-labeled, the detection of label immobilized 
on the surface is indicative of complex. Where the non-immobilized species is not pre-labeled, an 
indirect label can be used to detect complexes anchored to the surface; e.g., by using a labeled antibody 
specific for the initially non-immobilized species. Depending upon the order of addition of reaction 
components, test compounds that inhibit complex formation or that disrupt preformed complexes can be 
detected. 

[0164] In another embodiment, the reaction can be conducted in a liquid phase in the presence or 
absence of test molecule, where the reaction products are separated from unreacted components, and the 
complexes are detected (e.g., using an immobilized antibody specific for one of the binding components 
to anchor any complexes formed in solution, and a labeled antibody specific for the other partner to 
detect anchored complexes). Again, depending upon the order of addition of reactants to the liquid 
phase, test compounds that inhibit complex or that disrupt preformed complexes can be identified. 

[0165] In an alternate embodiment, a homogeneous assay can be utilized. For example, a preformed 
complex of the target gene product and the interactive cellular or extracellular binding partner product is 
prepared. One or both of the target molecule or binding partner is labeled, and the signal generated by 
the label(s) is quenched upon complex formation (e.g., U.S. Patent No. 4,109,496 that utilizes this 
approach for immunoassays). Addition of a test molecule that competes with and displaces one of the 
species from the preformed complex will result in the generation of a signal above background. In this 
way, test substances that disrupt target molecule/binding partner complexes can be identified. 

[0166] Candidate therapeutics for treating osteoarthritis are identified from a group of test molecules 
that interact with a target molecule. Test molecules are normally ranked according to the degree with 
which they modulate (e.g., agonize or antagonize) a function associated with the target molecule (e.g., 
DNA replication and/or processing, RNA transcription and/or processing, polypeptide production and/or 
processing, and/or biological function/activity), and then top ranking modulators are selected. Also, 
pharmacogenomic information described herein can determine the rank of a modulator. The top 10% of 
ranked test molecules often are selected for further testing as candidate therapeutics, and sometimes the 
top 15%, 20%, or 25% of ranked test molecules are selected for further testing as candidate therapeutics. 
Candidate therapeutics typically are formulated for administration to a subject. 
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Therapeutic Formulations 

|0167] Formulations and pharmaceutical compositions typically include in combination with a 
pharmaceutically acceptable carrier one or more target molecule modulators. The modulator often is a 
test molecule identified as having an interaction with a target molecule by a screening method described 
above. The modulator may be a compound, an antisense nucleic acid, a ribozyme, an antibody, or a 
binding partner. Also, formulations may comprise a target polypeptide or fragment thereof in 
combination with a pharmaceutically acceptable carrier. 

[0168] Formulations or pharmaceutical compositions typically include in combination with a 
pharmaceutically acceptable carrier, a compound, an antisense nucleic acid, a ribozyme, an antibody, a 
binding partner that interacts with an ADAMTS2 polypeptide, a ADAMTS2 nucleic acid, or a fragment 
thereof. The formulated molecule may be one that is identified by a screening method described above. 
Also, formulations may comprise a ADAMTS2 polypeptide or fragment thereof, where the ADAMTS2 
polypeptide contains an isoleucine at position 245 of SEQ ID NO: 4, and a pharmaceutically acceptable 
carrier. Also, formulations may comprise an active ADAMTS2 polypeptide or fragment thereof, where 
ADAMTS2 polypeptide fragments having activity are selected from amino acids 252-121 1 , 253-121 1, 
254-1211, 255-1211, 256-1211, 257-1211, 258-1211, 259-1211 or 260-1211 of SEQ ID NO: 4, where it 
is understood that the active form of ADAMTS2 does not contain the propeptide domain. As used herein, 
the term "pharmaceutically acceptable carrier" includes solvents, dispersion media, coatings, 
antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, compatible with 
pharmaceutical administration. Supplementary active compounds can also be incorporated into the 
compositions. 

[0169] As used herein, the term "pharmaceutically acceptable carrier" includes solvents, dispersion 
media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. Supplementary active compounds can also be 
incorporated into the compositions. Pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

[0170] A pharmaceutical composition typically is formulated to be compatible with its intended 
route of administration. Examples of routes of administration include parenteral, e.g., intravenous, 
intradermal, subcutaneous, oral {e.g., inhalation), transdermal (topical), transmucosal, and rectal 
administration. Solutions or suspensions used for parenteral, intradermal, or subcutaneous application 
can include the following components: a sterile diluent such as water for injection, saline solution, fixed 
oils, polyethylene glycols, glycerin, propylene glycol or other synthetic solvents; antibacterial agents 
such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; 
chelating agents such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or phosphates 
and agents for the adjustment of tonicity such as sodium chloride or dextrose. pH can be adjusted with 
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acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 
enclosed in ampoules, disposable syringes or multiple dose vials made of glass or plastic. 

[0171] Oral compositions generally include an inert diluent or an edible earner. For the purpose of 
oral therapeutic administration, the active compound can be incorporated with excipients and used in the 
form of tablets, troches, or capsules, e.g., gelatin capsules. Oral compositions can also be prepared using 
a fluid carrier for use as a mouthwash. Pharmaceutically compatible binding agents, and/or adjuvant 
materials can be included as part of the composition. The tablets, pills, capsules, troches and the like can 
contain any of the following ingredients, or compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a 
disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant such as magnesium stearate 
or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; 
or a flavoring agent such as peppermint, methyl salicylate, or orange flavoring. 

[0172] Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions 
(where water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile 
injectable solutions or dispersion. For intravenous administration, suitable carriers include physiological 
saline, bacteriostatic water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline 
(PBS). In all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It should be stable under the conditions of manufacture and storage and must be 
preserved against the contaminating action of microorganisms such as bacteria and fungi. The carrier can 
be a solvent or dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyethylene glycol, and the like), and suitable mixtures thereof. The proper 
fluidity can be maintained, for example, by the use of a coating such as lecithin, by the maintenance of 
the required particle size in the case of dispersion and by the use of surfactants. Prevention of the action 
of microorganisms can be achieved by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to 
include isotonic agents, for example, sugars, polyalcohols such as mannitol, sorbitol, sodium chloride in 
the composition. Prolonged absorption of the injectable compositions can be brought about by including 
in the composition an agent which delays absorption, for example, aluminum monostearate and gelatin. 

[0173] Sterile injectable solutions can be prepared by incorporating the active compound in the 
required amount in an appropriate solvent with one or a combination of ingredients enumerated above, as 
required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the 
active compound into a sterile vehicle which contains a basic dispersion medium and the required other 
ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile 
injectable solutions, the preferred methods of preparation are vacuum drying and freeze-drying which 
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yields a powder of the active ingredient plus any additional desired ingredient from a previously sterile- 
filtered solution thereof. 

[0174] For administration by inhalation, the compounds are delivered in the form of an aerosol 
spray from pressured container or dispenser which contains a suitable propellant, e.g., a gas such as 
carbon dioxide, or a nebulizer. 

[0175] Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be permeated are 
used in the formulation. Such penetrants are generally known in the art, and include, for example, for 
transmucosal administration, detergents, bile salts, and fusidic acid derivatives. Transmucosal 
administration can be accomplished through the use of nasal sprays or suppositories. For transdermal 
administration, the active compounds are formulated into ointments, salves, gels, or creams as generally 
known in the art. Molecules can also be prepared in the form of suppositories (e.g., with conventional 
suppository bases such as cocoa butter and other glycerides) or retention enemas for rectal delivery. 

[0176] In one embodiment, active molecules are prepared with carriers that will protect the 
compound against rapid elimination from the body, such as a controlled release formulation, including 
implants and microencapsulated delivery systems. Biodegradable, biocompatible polymers can be used, 
such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and 
polylactic acid. Methods for preparation of such formulations will be apparent to those skilled in the art. 
Materials can also be obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. 
Liposomal suspensions (including liposomes targeted to infected cells with monoclonal antibodies to 
viral antigens) can also be used as pharmaceutical^ acceptable carriers. These can be prepared 
according to methods known to those skilled in the art, for example, as described in U.S. Patent 
No. 4,522,811. 

[0177] It is advantageous to formulate oral or parenteral compositions in dosage unit form for ease 
of administration and uniformity of dosage. Dosage unit form as used herein refers to physically discrete 
units suited as unitary dosages for the subject to be treated; each unit containing a predetermined quantity 
of active compound calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. 

[0178] Toxicity and therapeutic efficacy of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the LD 50 (the 
dose lethal to 50% of the population) and the ED 50 (the dose therapeutically effective in 50% of the 
population). The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be 
expressed as the ratio LD 50 /ED 5 o- Molecules which exhibit high therapeutic indices are preferred. While 
molecules that exhibit toxic side effects may be used, care should be taken to design a delivery system 
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that targets such compounds to the site of affected tissue in order to minimize potential damage to 
uninfected cells and, thereby, reduce side effects. 

[0179] The data obtained from the cell culture assays and animal studies can be used in formulating 
a range of dosage for use in humans. The dosage of such molecules lies preferably within a range of 
circulating concentrations that include the ED 50 with little or no toxicity. The dosage may vary within 
this range depending upon the dosage form employed and the route of administration utilized. For any 
molecules used in the methods described herein, the therapeutically effective dose can be estimated 
initially from cell culture assays. A dose may be formulated in animal models to achieve a circulating 
plasma concentration range that includes the IC 50 (i.e., the concentration of the test compound which 
achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such information can be 
used to more accurately determine useful doses in humans. Levels in plasma may be measured, for 
example, by high performance liquid chromatography. 

[0180] As defined herein, a therapeutically effective amount of protein or polypeptide (i.e., an 
effective dosage) ranges from about 0.001 to 30 mg/kg body weight, sometimes about 0.01 to 25 mg/kg 
body weight, often about 0.1 to 20 mg/kg body weight, and more often about 1 to 10 mg/kg, 2 to 9 
mg/kg, 3 to 8 mg/kg, 4 to 7 mg/kg, or 5 to 6 mg/kg body weight. The protein or polypeptide can be 
administered one time per week for between about 1 to 10 weeks, sometimes between 2 to 8 weeks, often 
between about 3 to 7 weeks, and more often for about 4, 5, or 6 weeks. The skilled artisan will 
appreciate that certain factors may influence the dosage and timing required to effectively treat a subject, 
including but not limited to the severity of the disease or disorder, previous treatments, the general health 
and/or age of the subject, and other diseases present. Moreover, treatment of a subject with a 
therapeutically effective amount of a protein, polypeptide, or antibody can include a single treatment or, 
preferably, can include a series of treatments. 

[0181] With regard to polypeptide formulations, featured herein is a method for treating 
osteoarthritis in a subject, which comprises contacting one or more cells in the subject with a first 
polypeptide, where the subject comprises a second polypeptide having one or more polymorphic 
variations associated with cancer, and where the first polypeptide comprises fewer polymorphic 
variations associated with cancer than the second polypeptide. The first and second polypeptides are 
encoded by a nucleic acid which comprises a nucleotide sequence in SEQ ID NO: 1-3; a nucleotide 
sequence which encodes a polypeptide consisting of an amino acid sequence encoded by a nucleotide 
sequence referenced in SEQ ID NO: 1-3; a nucleotide sequence which encodes a polypeptide that is 90% 
or more identical to an amino acid sequence encoded by a nucleotide sequence of SEQ ED NO: 1-3 and a 
nucleotide sequence 90% or more identical to a nucleotide sequence in SEQ ID NO: 1-3. The subject 
often is a human. 
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|0182] For antibodies, a dosage of 0. 1 mg/kg of body weight (generally 10 mg/kg to 20 mg/kg) is 
often utilized. If the antibody is to act in the brain, a dosage of 50 mg/kg to 100 mg/kg is often 
appropriate. Generally, partially human antibodies and fully human antibodies have a longer half-life 
within the human body than other antibodies. Accordingly, lower dosages and less frequent 
administration is often possible. Modifications such as lipidation can be used to stabilize antibodies and 
to enhance uptake and tissue penetration (e.g., into the brain). A method for lipidation of antibodies is 
described by Cruikshank et al., J. Acquired Immune Deficiency Syndromes and Human Retrovirology 
74:193 (1997). 

[0183] Antibody conjugates can be used for modifying a given biological response, the drug moiety 
is not to be construed as limited to classical chemical therapeutic agents. For example, the drug moiety 
may be a protein or polypeptide possessing a desired biological activity. Such proteins may include, for 
example, a toxin such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a polypeptide such as 
tumor necrosis factor, alpha-interferon, beta-interferon, nerve growth factor, platelet derived growth 
factor, tissue plasminogen activator; or, biological response modifiers such as, for example, lymphokines, 
interleukin-1 ("IL-1"), interleukin-2 ("IL-2"), interleukin-6 ("EL-6"), granulocyte macrophage colony 
stimulating factor ("GM-CSF"), granulocyte colony stimulating factor ("G-CSF"), or other growth 
factors. Alternatively, an antibody can be conjugated to a second antibody to form an antibody 
heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

[0184] For compounds, exemplary doses include milligram or microgram amounts of the compound 
per kilogram of subject or sample weight, for example, about 1 microgram per kilogram to about 500 
milligrams per kilogram, about 100 micrograms per kilogram to about 5 milligrams per kilogram, or 
about 1 microgram per kilogram to about 50 micrograms per kilogram. It is understood that appropriate 
doses of a small molecule depend upon the potency of the small molecule with respect to the expression 
or activity to be modulated. When one or more of these small molecules is to be administered to an 
animal (e.g., a human) in order to modulate expression or activity of a polypeptide or nucleic acid 
described herein, a physician, veterinarian, or researcher may, for example, prescribe a relatively low 
dose at first, subsequently increasing the dose until an appropriate response is obtained. In addition, it is 
understood that the specific dose level for any particular animal subject will depend upon a variety of 
factors including the activity of the specific compound employed, the age, body weight, general health, 
gender, and diet of the subject, the time of administration, the route of administration, the rate of 
excretion, any drug combination, and the degree of expression or activity to be modulated. 

[0185] With regard to nucleic acid formulations, gene therapy vectors can be delivered to a subject 
by, for example, intravenous injection, local administration (see, e.g., U.S. Patent 5,328,470) or by 
stereotactic injection (see e.g., Chen et al., (1994) Proc. Natl. Acad. Sci. USA 97:3054-3057). 
Pharmaceutical preparations of gene therapy vectors can include a gene therapy vector in an acceptable 
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diluent, or can comprise a slow release matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector can be produced intact from recombinant cells 
(e.g., retroviral vectors) the pharmaceutical preparation can include one or more cells which produce the 
gene delivery system. Examples of gene delivery vectors are described herein. 

Therapeutic Methods 

[0186] A therapeutic formulation described above can be administered to a subject in need of a 
therapeutic for inducing a desired biological response.. Therapeutic formulations can be administered by 
any of the paths described herein. With regard to both prophylactic and therapeutic methods of 
treatment, such treatments may be specifically tailored or modified, based on knowledge obtained from 
pharmacogenomic analyses described herein. 

[0187] As used herein, the term "treatment" is defined as the application or administration of a 
therapeutic formulation to a subject, or application or administration of a therapeutic agent to an isolated 
tissue or cell line from a subject with the purpose to cure, heal, alleviate, relieve, alter, remedy, 
ameliorate, improve or affect osteoarthritis, symptoms of osteoarthritis or a predisposition towards 
osteoarthritis. A therapeutic formulation includes, but is not limited to, small molecules, peptides, 
antibodies, ribozymes and antisense oligonucleotides. Administration of a therapeutic formulation can 
occur prior to the manifestation of symptoms characteristic of osteoarthritis, such that osteoarthritis is 
prevented or delayed in its progression. The appropriate therapeutic composition can be determined 
based on screening assays described herein. 

[0188] As discussed, successful treatment of osteoarthritis can be brought about by techniques that 
serve to agonize target molecule expression or function, or alternatively, antagonize target molecule 
expression or function. These techniques include administration of modulators that include, but are not 
limited to, small organic or inorganic molecules; antibodies (including, for example, polyclonal, 
monoclonal, humanized, anti-idiotypic, chimeric or single chain antibodies, and Fab, F(ab') 2 and Fab 
expression library fragments, scFV molecules, and epitope-binding fragments thereof); and peptides, 
phosphopeptides, or polypeptides. 

[0189] Further, antisense and ribozyme molecules that inhibit expression of the target gene can also 
be used to reduce the level of target gene expression, thus effectively reducing the level of target gene 
activity. Still further, triple helix molecules can be utilized in reducing the level of target gene activity. 
Antisense, ribozyme and triple helix molecules are discussed above. It is possible that the use of 
antisense, ribozyme, and/or triple helix molecules to reduce or inhibit mutant gene expression can also 
reduce or inhibit the transcription (triple helix) and/or translation (antisense, ribozyme) of mRNA 
produced by normal target gene alleles, such that the concentration of normal target gene product present 
can be lower than is necessary for a normal phenotype. In such cases, nucleic acid molecules that encode 
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and express target gene polypeptides exhibiting normal target gene activity can be introduced into cells 
via gene therapy method. Alternatively, in instances in that the target gene encodes an extracellular 
polypeptide, it can be preferable to co-administer normal target gene polypeptide into the cell or tissue in 
order to maintain the requisite level of cellular or tissue target gene activity. 

[0190) Another method by which nucleic acid molecules may be utilized in treating or preventing 
osteoarthritis is use of aptamer molecules specific for target molecules. Aptamers are nucleic acid 
molecules having a tertiary structure which permits them to specifically bind to ligands {see, e.g., 
Osborne, et ai, Curr. Opin. Chem. Bioll(l): 5-9 (1997); and Patel, D. J., Curr. Opin. Chem. Biol. 
Jun;l(l): 32-46(1997)). 

[0191] Yet another method of utilizing nucleic acid molecules for osteoarthritis treatment is gene 
therapy, which can also be referred to as allele therapy. Provided herein is a gene therapy method for 
treating osteoarthritis in a subject, which comprises contacting one or more cells in the subject or from 
the subject with a nucleic acid having a first nucleotide sequence (e.g., the first nucleotide sequence is 
identical to or substantially identical to a nucleotide sequence of SEQ ID NO: 1-3). Genomic DNA in 
the subject comprises a second nucleotide sequence having one or more polymorphic variations 
associated with osteoarthritis (e.g., the second nucleotide sequence is identical to or substantially 
identical to a nucleotide sequence of SEQ ID NO: 1). The first and second nucleotide sequences 
typically are substantially identical to one another, and the first nucleotide sequence comprises fewer 
polymorphic variations associated with osteoarthritis than the second nucleotide sequence. The first 
nucleotide sequence may comprise a gene sequence that encodes a full-length polypeptide or a fragment 
thereof. The subject is often a human. Allele therapy methods often are utilized in conjunction with a 
method of first determining whether a subject has genomic DNA that includes polymorphic variants 
associated with osteoarthritis. 

[0192] In another allele therapy embodiment, provided herein is a method which comprises 
contacting one or more cells in the subject or from the subject with a polypeptide encoded by a nucleic 
acid having a first nucleotide sequence (e.g., the first nucleotide sequence is identical to or substantially 
identical to the nucleotide sequence of SEQ ID NO: 1-3). Genomic DNA in the subject comprises a 
second nucleotide sequence having one or more polymorphic variations associated with osteoarthritis 
(e.g., the second nucleotide sequence is identical to or substantially identical to a nucleotide sequence of 
SEQ ID NO: 1). The first and second nucleotide sequences typically are substantially identical to one 
another, and the first nucleotide sequence comprises fewer polymorphic variations associated with 
osteoarthritis than the second nucleotide sequence. The first nucleotide sequence may comprise a gene 
sequence that encodes a full-length polypeptide or a fragment thereof. The subject is often a human. The 
method often comprises supplementing arthritis-associated ADAMTS2 polypeptide with a non-arthritis- 
associated ADAMTS2 polypeptide or fragment thereof, where the non-arthritis-associated form of 
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ADAMTS2 contains an isoleucine at position 245 of SEQ ID NO: 4 having enzymatic activity. The 
arthritis-associated ADAMTS2 polypeptide sometimes contains a valine at position 245 of SEQ ID NO: 4 
having an altered enzymatic activity varying from the non-arthritis-associated polypeptide. 

[0193] In an embodiment, provided is a method of increasing the synthesis of procollagen II 
comprising providing or administering to individuals in need of increasing levels of type II collagen the 
pharmaceutical or physiologically acceptable composition comprising active human ADAMTS2 protein 
or fragment thereof, where ADAMTS2 polypeptide fragments having activity are selected from amino 
acids 252-121 1,253-121 1,254-121 1,255-1211, 256-1211,257-1211, 258-1211,259-1211 or 260-1211 
of SEQ ID NO: 4, where it is understood that the active form of ADAMTS2 does not contain the 
propeptide domain. 

[0194] In another embodiment, provided herein is a method of increasing the synthesis of 
procollagen II comprising providing or administering to individuals in need of increasing levels of type II 
collagen the pharmaceutical or physiologically acceptable composition comprising an enzyme or 
molecule capable of cleaving ADAMTS2 propeptide, e.g., a funn-type endopeptidase or N- 
ethylmaleimide described herein 

[0195| For antibody-based therapies, antibodies can be generated that are both specific for target 
molecules and that reduce target molecule activity. Such antibodies may be administered in instances 
where antagonizing a target molecule function is appropriate for the treatment of osteoarthritis. 

[0196] In circumstances where stimulating antibody production in an animal or a human subject by 
injection with a target molecule is harmful to the subject, it is possible to generate an immune response 
against the target molecule by use of anti-idiotypic antibodies {see, e.g., Herlyn, Ann. Med.;31(l): 66-78 
(1999); and Bhattacharya-Chatterjee & Foon, Cancer Treat. Res.; 94: 51-68 (1998)). Introducing an 
anti-idiotypic antibody to a mammal or human subject often stimulates production of anti-anti-idiotypic 
antibodies, which typically are specific to the target molecule. Vaccines directed to osteoarthritis also 
may be generated in this fashion. 

[0197] In instances where the target molecule is intracellular and whole antibodies are used, 
internalizing antibodies may be preferred. Lipofectin or liposomes can be used to deliver the antibody or 
a fragment of the Fab region that binds to the target antigen into cells. Where fragments of the antibody 
are used, the smallest inhibitory fragment that binds to the target antigen is preferred. For example, 
peptides having an amino acid sequence corresponding to the Fv region of the antibody can be used. 
Alternatively, single chain neutralizing antibodies that bind to intracellular target antigens can also be 
administered. Such single chain antibodies can be administered, for example, by expressing nucleotide 
sequences encoding single-chain antibodies within the target cell population (see, e.g., Marasco et al., 
Proc. Natl. Acad. Sci. USA 90: 7889-7893 (1993)). 
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[0198| Modulators can be administered to a patient at therapeutically effective doses to treat 
osteoarthritis. A therapeutically effective dose refers to an amount of the modulator sufficient to result in 
amelioration of symptoms of osteoarthritis. Toxicity and therapeutic efficacy of modulators can be 
determined by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
determining the LD 50 (the dose lethal to 50% of the population) and the ED 50 (the dose therapeutically 
effective in 50% of the population). The dose ratio between toxic and therapeutic effects is the 
therapeutic index and it can be expressed as the ratio LD50/ED50. Modulators that exhibit large 
therapeutic indices are preferred. While modulators that exhibit toxic side effects can be used, care 
should be taken to design a delivery system that targets such molecules to the site of affected tissue in 
order to minimize potential damage to uninfected cells, thereby reducing side effects. 

[0199] Data obtained from cell culture assays and animal studies can be used in formulating a range 
of dosages for use in humans. The dosage of such compounds lies preferably within a range of 
circulating concentrations that include the ED 50 with little or no toxicity. The dosage can vary within this 
range depending upon the dosage form employed and the route of administration utilized. For any 
compound used in the methods described herein, the therapeutically effective dose can be estimated 
initially from cell culture assays. A dose can be formulated in animal models to achieve a circulating 
plasma concentration range that includes the IC 50 {i.e., the concentration of the test compound that 
achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such information can be 
used to more accurately determine useful doses in humans. Levels in plasma can be measured, for 
example, by high performance liquid chromatography. 

[0200J Another example of effective dose determination for an individual is the ability to directly 
assay levels of "free" and "bound" compound in the serum of the test subject. Such assays may utilize 
antibody mimics and/or "biosensors" that have been created through molecular imprinting techniques. 
Molecules that modulate target molecule activity are used as a template, or "imprinting molecule", to 
spatially organize polymerizable monomers prior to their polymerization with catalytic reagents. The 
subsequent removal of the imprinted molecule leaves a polymer matrix which contains a repeated 
"negative image" of the compound and is able to selectively rebind the molecule under biological assay 
conditions. A detailed review of this technique can be seen in Ansell et al., Current Opinion in 
Biotechnology 7: 89-94 (1996) and in Shea, Trends in Polymer Science 2: 166-173 (1994). Such 
"imprinted" affinity matrixes are amenable to ligand-binding assays, whereby the immobilized 
monoclonal antibody component is replaced by an appropriately imprinted matrix. An example of the 
use of such matrixes in this way can be seen in Vlatakis, et al.. Nature 361: 645-647 (1993). Through the 
use of isotope-labeling, the "free" concentration of compound which modulates target molecule 
expression or activity readily can be monitored and used in calculations of IC 50 . Such "imprinted" 
affinity matrixes can also be designed to include fluorescent groups whose photon-emitting properties 
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measurably change upon local and selective binding of target compound. These changes readily can be 
assayed in real time using appropriate fiberoptic devices, in turn allowing the dose in a test subject to be 
quickly optimized based on its individual IC S0 - An example of such a "biosensor" is discussed in Kriz et 
a/., Analytical Chemistry 67: 2142-2144 (1995). 

[0201] The examples set forth below are intended to illustrate but not limit the invention. 

Examples 

[0202] In the following studies a group of subjects was selected according to specific parameters 
relating to osteoarthritis. Nucleic acid samples obtained from individuals in the study group were 
subjected to genetic analysis, which identified associations between osteoarthritis and a polymorphism in 
the ADAMTS2 gene on chromosome five. The polymorphism was genotyped again in two replication 
cohorts consisting of individuals selected for OA. In addition, SNPs proximal to the incident 
polymorphism were identified and allelotyped in OA case and control pools. Methods are described for 
producing ADAMTS2 polypeptide and ADAMTS2 polypeptide variants in vitro or in vivo, ADAMTS2 
nucleic acids or polypeptides and variants thereof are utilized for screening test molecules for those that 
interact with ADAMTS2 molecules. Test molecules identified as interactors with ADAMTS2 molecules 
and ADAMTS2 variants are further screened in vivo to determine whether they treat osteoarthritis. 

Example 1 
Samples and Pooling Strategies 

Sample Selection 

[0203] Blood samples were collected from individuals diagnosed with knee osteoarthritis, which 
were referred to as case samples. Also, blood samples were collected from individuals not diagnosed 
with knee osteoarthritis as gender and age-matched controls. A database was created that listed all 
phenotypic trait information gathered from individuals for each case and control sample. Genomic DNA 
was extracted from each of the blood samples for genetic analyses. 

DNA Extraction from Blood Samples > 
[0204] Six to ten milliliters of whole blood was transferred to a 50 ml tube containing 27 ml of red 
cell lysis solution (RCL). The tube was inverted until the contents were mixed. Each tube was incubated 
for 10 minutes at room temperature and inverted once during the incubation. The tubes were then 
centrifuged for 20 minutes at 3000 x g and the supernatant was carefully poured off. 100-200 pi of 
residual liquid was left in the tube and was pipetted repeatedly to resuspend the pellet in the residual 
supernatant. White cell lysis solution (WCL) was added to the tube and pipetted repeatedly until 
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completely mixed. While no incubation was normally required, the solution was incubated at 37°C or 
room temperature if cell clumps were visible after mixing until the solution was homogeneous. 2 ml of 
protein precipitation was added to the cell lysate. The mixtures were vortexed vigorously at high speed 
for 20 sec to mix the protein precipitation solution uniformly with the cell lysate, and then centrifuged for 
10 minutes at 3000 x g. The supernatant containing the DNA was then poured into a clean 15 ml tube, 
which contained 7 ml of 100% isopropanol. The samples were mixed by inverting the tubes gently until 
white threads of DNA were visible. Samples were centrifuged for 3 minutes at 2000 x g and the DNA 
was visible as a small white pellet. The supernatant was decanted and 5 ml of 70% ethanol was added to 
each tube. Each tube was inverted several times to wash the DNA pellet, and then centrifuged for 1 
minute at 2000 x g. The ethanol was decanted and each tube was drained on clean absorbent paper. The 
DNA was dried in the tube by inversion for 10 minutes, and then 1000 \x\ of IX TE was added. The size 
of each sample was estimated, and less TE buffer was added during the following DNA hydration step if 
the sample was smaller. The DNA was allowed to rehydrate overnight at room temperature, and DNA 
samples were stored at 2-8°C. 

[0205] DNA was quantified by placing samples on a hematology mixer for at least 1 hour. DNA 
was serially diluted (typically 1:80, 1:160, 1:320, and 1:640 dilutions) so that it would be within the 
measurable range of standards. 125 u.1 of diluted DNA was transferred to a clear U-bottom microtitre 
plate, and 125 u,l of IX TE buffer was transferred into each well using a multichannel pipette. The DNA 
and IX TE were mixed by repeated pipetting at least 15 times, and then the plates were sealed. 50 u.1 of 
diluted DNA was added to wells A5-H12 of a black flat bottom microtitre plate. Standards were inverted 
six times to mix them, and then 50 u.1 of 1 X TE buffer was pipetted into well Al , 1 000 ng/ml of standard 
was pipetted into well A2, 500 ng/ml of standard was pipetted into well A3, and 250 ng/ml of standard 
was pipetted into well A4. PicoGreen (Molecular Probes, Eugene, Oregon) was thawed and freshly 
diluted 1 :200 according to the number of plates that were being measured. PicoGreen was vortexed and 
then 50(0.1 was pipetted into all wells of the black plate with the diluted DNA. DNA and PicoGreen were 
mixed by pipetting repeatedly at least 10 times with the multichannel pipette. The plate was placed into a 
Fluoroskan Ascent Machine (microplate fluorometer produced by Labsystems) and the samples were 
allowed to incubate for 3 minutes before the machine was run using filter pairs 485 nm excitation and 
538 nm emission wavelengths. Samples having measured DNA concentrations of greater than 450 ng/p.1 
were re-measured for conformation. Samples having measured DNA concentrations of 20 ng/pl or less 
were re-measured for confirmation. 
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Pooling Strategies - Discovery Cohort 

[0206] Samples were derived from the Nottingham knee OA family study (UK) where index cases 
were identified through a knee replacement registry. Siblings were approached and assessed with knee x- 
rays and assigned status as affected or unaffected. In all 1,157 individuals were available. In order to 
create same-sex pools of appropriate sizes, 335 unrelated female individuals with OA from the 
Nottingham OA sample were selected for the case pool. The control pool was made up of unrelated 
female individuals from the St. Thomas twin study (England) with normal knee x-rays and without other 
indications of OA, regardless of anatomical location, as well as lacking family history of OA. The St. 
Thomas twin study consists of Caucasian, female participants from the St. Thomas' Hospital, London, 
adult-twin registry, which is a voluntary registry of >4,000 twin pairs ranging from 18 to 76 years of age. 
The female case samples and female control samples are described further in Table 1 below. 

[0207] A select set of samples from each group were utilized to generate pools, and one pool was 
created for each group. Each individual sample in a pool was represented by an equal amount of 
genomic DNA. For example, where 25 ng of genomic DNA was utilized in each PCR reaction and there 
were 200 individuals in each pool, each individual would provide 125 pg of genomic DNA. Inclusion or 
exclusion of samples for a pool was based upon the following criteria: the sample was derived from an 
individual characterized as Caucasian; the sample was derived from an individual of British paternal and 
maternal descent; case samples were derived from individuals diagnosed with specific knee osteoarthritis 
(OA) and were recruited from an OA knee replacement clinic. Control samples were derived from 
individuals free of OA, family history of OA, and rheumatoid arthritis. Also, sufficient genomic DNA 
was extracted from each blood sample for all allelotyping and genotypmg reactions performed during the 
study. Phenotype information from each individual was collected and included age of the individual, 
gender, family history of OA, general medical information (e.g., height, weight, thyroid disease, 
diabetes, psoriasis, hysterectomy), joint history (previous and current symptoms, joint-related operations, 
age at onset of symptoms, date of primary diagnosis, age of individual as of primary diagnosis and order 
of involvement), and knee-related findings (crepitus, restricted passive movement, bony 
swelling/deformity). Additional knee information included knee history, current symptoms, any major 
knee injury, menisectomy, knee replacement surgery, age of surgery, and treatment history (including 
hormone replace therapy (HRT)). Samples that met these criteria were added to appropriate pools based 
on disease status. 
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[0208] The selection process yielded the pools set forth in Table 1, which were used in the studies 
that follow: 

TABLE 1 





Female case 


Female control 


Pool size 
(Number) 


335 


335 


Pool Criteria 

(ex: case/control) 


control 


case 


Mean Age 

(ex: years) 


57.21 


69.95 



Example 2 

Association of Polymorphic Variants with Osteoarthritis 

[0209] A whole-genome screen was performed to identify particular SNPs associated with 
occurrence of osteoarthritis. As described in Example 1, two sets of samples were utilized, which 
included samples from female individuals having knee osteoarthritis (osteoarthritis cases), and samples 
from female individuals not having knee osteoarthritis (female controls). The initial screen of each pool 
was performed in an allelotyping study, in which certain samples in each group were pooled. By pooling 
DNA from each group, an allele frequency for each SNP in each group was calculated. These allele 
frequencies were then compared to one another. Particular SNPs were considered as being associated 
with osteoarthritis when allele frequency differences calculated between case and control pools were 
statistically significant. SNP disease association results obtained from the allelotyping study were then 
validated by genotyping each associated SNP across all samples from each pool. The results of the 
genotyping then were analyzed, allele frequencies for each group were calculated from the individual 
genotyping results, and a p-value was calculated to determine whether the case and control groups had 
statistically significant differences in allele frequencies for a particular SNP. When the genotyping 
results agreed with the original allelotyping results, the SNP disease association was considered validated 
at the genetic level. 

SNP Panel Used for Genetic Analyses 
[0210] A whole-genome SNP screen began with an initial screen of approximately 25,000 SNPs 
over each set of disease and control samples using a pooling approach. The pools studied in the screen 
are described in Example 1. The SNPs analyzed in this study were part of a set of 25,488 SNPs 
confirmed as being statistically polymorphic as each is characterized as having a minor allele frequency 
of greater than 10%. The SNPs in the set reside in genes or in close proximity to genes, and many reside 
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in gene exons. Specifically, SNPs in the set are located in exons, introns, and within 5,000 base-pairs 
upstream of a transcription start site of a gene. In addition, SNPs were selected according to the 
following criteria: they are located in ESTs; they are located in Locuslink or Ensembl genes; and they are 
located in Genomatix promoter predictions. SNPs in the set were also selected on the basis of even 
spacing across the genome, as depicted in Table 2. 

[0211] A case-control study design using a whole genome association strategy involving 
approximately 28,000 single nucleotide polymorphisms (SNPs) was employed. Approximately 25,000 
SNPs were evenly spaced in gene-based regions of the human genome with a median inter-marker 
distance of about 40,000 base pairs. Additionally, approximately 3,000 SNPs causing amino acid 
substitutions in genes described in the literature as candidates for various diseases were used. The case- 
control study samples were of female Caucasian origin (British paternal and maternal descent) 670 
individuals were equally distributed in two groups: female controls and female cases. The whole genome 
association approach was first conducted on 2 DNA pools representing the 2 groups. Significant markers 
were confirmed by individual genotyping. 



TABLE 2 



General Statistics 


Spacing Statistics 


Total # of SNPs 


25,488 


Median 


37,058 bp 


#ofExonic SNPs 


>4,335(17%) 


Minimum* 


1,000 bp 


# SNPs with refSNP ID 


20,776 (81%) 


Maximum* 


3,000,000 bp 


Gene Coverage 


>10,000 


Mean 


122,412 bp 


Chromosome Coverage 


All 


Std Deviation 


373,325 bp 






*Excludes outliers 





Allelotvping and Genotyping Results 
[0212] The genetic studies summarized above and described in more detail below identified an 
allelic variant in the ADAMTS2 gene that is associated with osteoarthritis. 

Assay for Verifying, Allelotvping, and Genotyping SNPs 
[0213] A MassARRAY™ system (Sequenom, Inc.) was utilized to perform SNP genotyping in a 
high-throughput fashion. This genotyping platform was complemented by a homogeneous, single-tube 
assay method (hME™ or homogeneous MassEXTEND™ (Sequenom, Inc.)) in which two genotyping 
primers anneal to and amplify a genomic target surrounding a polymorphic site of interest. A third 
primer (the MassEXTEND™ primer), which is complementary to the amplified target up to but not 



62 



PATENT 

Docket 524593008900 



including the polymorphism, was then enzymatically extended one or a few bases through the 
polymorphic site and then terminated. 

[0214] For each polymorphism, SpectroDESIGNER™ software (Sequenom, Inc.) was used to 
generate a set of PCR primers and a MassEXTEND™ primer which where used to genotype the 
polymorphism. Other primer design software could be used or one of ordinary skill in the art could 
manually design primers based on his or her knowledge of the relevant factors and considerations in 
designing such primers. Table 3 shows PCR primers and Table 4 shows extension primers used for 
analyzing polymorphisms. The initial PCR amplification reaction was performed in a 5 u.1 total volume 
containing IX PCR buffer with 1 .5 mM MgCl 2 (Qiagen), 200 uM each of dATP, dGTP, dCTP, dTTP 
(Gibco-BRL), 2.5 ng of genomic DNA, 0.1 units of HotStar DNA polymerase (Qiagen), and 200 nM 
each of forward and reverse PCR primers specific for the polymorphic region of interest. 



TABLE 3: PCR Primers 



SNP 
Reference 


Forward PCR primer 


Reverse PCR primer 


rs398829 


ACGTTGGATGTAGTCATCGTCCGCAGCATG 


ACGTTGGATGAAGACGGTGTCCTCTCCTTG 



[0215] Samples were incubated at 95°C for 15 minutes, followed by 45 cycles of 95°C for 20 
seconds, 56°C for 30 seconds, and 72°C for 1 minute, finishing with a 3 minute final extension at 72°C. 
Following amplification, shrimp alkaline phosphatase (SAP) (0.3 units in a 2 pi volume) (Amersham 
Pharmacia) was added to each reaction (total reaction volume was 7 pi) to remove any residual dNTPs 
that were not consumed in the PCR step. Samples were incubated for 20 minutes at 37°C, followed by 5 
minutes at 85°C to denature the SAP. 

[0216] Once the SAP reaction was complete, a primer extension reaction was initiated by adding a 
polymorphism-specific MassEXTEND™ primer cocktail to each sample. Each MassEXTEND™ 
cocktail included a specific combination of dideoxynucleotides (ddNTPs) and deoxynucleotides (dNTPs) 
used to distinguish polymorphic alleles from one another. Methods for verifying, allelotyping and 
genotyping SNPs are disclosed, for example, in U.S. Pat. No. 6,258,538, the content of which is hereby 
incorporated by reference. In Table 4, ddNTPs are shown and the fourth nucleotide not shown is the 
dNTP. 



TABLE 4: Extension Primers 



SNP 
Reference 



Termination 
Mix 



TGGCGTGCTCCTCTAGGA 
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[0217] The MassEXTEND™ reaction was performed in a total volume of 9 ul, with the addition of 
IX ThermoSequenase buffer, 0.576 units of ThermoSequenase (Amersham Pharmacia), 600 nM 
MassEXTEND™ primer, 2 mM of ddATP and/or ddCTP and/or ddGTP and/or ddTTP, and 2 mM of 
dATP or dCTP or dGTP or dTTP. The deoxy nucleotide (dNTP) used in the assay normally was 
complementary to the nucleotide at the polymorphic site in the amplicon. Samples were incubated at 
94°C for 2 minutes, followed by 55 cycles of 5 seconds at 94°C, 5 seconds at 52°C, and 5 seconds 
at 72°C. 

[0218] Following incubation, samples were desalted by adding 16 ul of water (total reaction volume 
was 25 ul), 3 mg of SpectroCLEAN™ sample cleaning beads (Sequenom, Inc.) and allowed to incubate 
for 3 minutes with rotation. Samples were then robotically dispensed using a piezoelectric dispensing 
device (SpectroJET™ (Sequenom, Inc.)) onto either 96-spot or 384-spot silicon chips containing a matrix 
that crystallized each sample (SpectroCHIP™ (Sequenom, Inc.)). Subsequently, MALDI-TOF mass 
spectrometry (Biflex and Autoflex MALDI-TOF mass spectrometers (Bruker Daltonics) can be used) and 
SpectroTYPER RT™ software (Sequenom, Inc.) were used to analyze and interpret the SNP genotype for 
each sample. 



Genetic Analysis 

[0219] Minor allelic frequencies for the polymorphisms set forth in Table A were verified as being 
10% or greater using the extension assay described above in a group of samples isolated from 92 
individuals originating from the state of Utah in the United States, Venezuela and France (Coriell cell 
repositories). 

[0220] Genotyping results are shown for female pools in Table 5. In Table 5, "AF" refers to allelic 
frequency; and "F case" and "F control" refer to female case and female control groups, respectively. 

TABLE 5: Genotyping Results 



SNP Reference 



[0221] All of the single marker alleles set forth in Table A were considered validated, since the 
genotyping data agreed with the allelotyping data and each SNP significantly associated with 
osteoarthritis. Particularly significant associations with osteoarthritis are indicated by a calculated p- 
value of less than 0.05 for genotype results. 
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Example 3 

Association of Polymorphic Variants with Osteoarthritis in Replication Cohorts 

[0222] The single marker polymorphism set forth in Table A was genotyped again in two replication 
cohorts consisting of individuals selected for OA. 

Sample Selection and Pooling Strategies - Replication Sample 1 
[0223] A second case control sample (replication sample #1) was created by using 100 Caucasian 
female cases from Chingford, UK, and 148 unrelated female cases from the St. Thomas twin study. 
Cases were defined as having Kellgren-Lawrence (KL) scores of at least 2 in at least one knee x-ray. In 
addition, 199 male knee replacement cases from Nottingham were included. (For a cohort description, 
see the Nottingham description provided in Example 1). The control pool was made up of unrelated 
female individuals from the St. Thomas twin study (England) with normal knee x-rays and without other 
indications of OA, regardless of anatomical location, as well as lacking family history of OA. The St. 
Thomas twin study consists of Caucasian, female participants from the St. Thomas' Hospital, London, 
adult-twin registry, which is a voluntary registry of >4,000 twin pairs ranging from 18 to 76 years of age. 
The replication sample 1 cohort was used to replicate the initial results. Table 6 below summarizes the 
selected phenotype data collected from the case and control individuals. 



TABLE 6 



Phenotype 


Female cases (n=248): 
median (range)/ (n,%) 


Male cases (n=199): 
median (range)/ (n,%) 


Female controls (n=313): 

mean (range)/ (n,%) 


Age 


59 (39- 73) 


66 (45- 73) 


55 (50- 72) 


Height (cm) 


162 (141- 178) 


175 (152- 198) 


162 (141- 176) 


Weight (kg) 


68 (51- 123) 


86 (62-127) 


64(40- 111) 


Body mass index 
(kg/m 2 ) 


26 (18- 44) 


29 (21-41) 


24(18- 46) 


Kellgren- 
Lawrence* left 
knee 


0 (63, 26%), 1 (20, 8%), 2 
(105, 43%), 3 (58, 23%), 4 
(1 , 0%) 


NA 


NA 


Kellgren- 
Lawrence* right 
knee 


0 (43, 7%), 1 (18, 7%), 2 
(127, 52%), 3(57, 23%), 4 
(1 , 0%) 


NA 


NA 


KL* >2 both 
knees 


No (145, 59%), Yes (101, 
41%) 


NA 


NA 


KL* >2 either 
knee 


No (0, 0%), Yes (248, 100%) 


NA 


NA 



* 0: normal, 1 : doubtful, 2: definite osteophyte (bony protuberance), 3: joint space 
narrowing (with or without osteophyte), 4: joint deformity 
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Sample Selection and Pooling Strategies - Replication Sample 2 
|0224] A third case control sample (replication sample #2) was created by using individuals with 
symptoms of OA from Newfoundland, Canada. These individuals were recruited and examined by 
rheumatologists. Affected joints were x-rayed and a final diagnosis of definite or probable OA was made 
according to American College of Rheumatology criteria by a single rheumatologist to avoid any inter- 
examiner diagnosis variability. Controls were recruited from volunteers without any symptoms from the 
musculoskeletal system based on a normal joint exam performed by a rheumatologist. Only cases with a 
diagnosis of definite OA were included in the study. Only individuals of Caucasian origin were included. 
The cases consisted of 228 individuals with definite knee OA, 106 individuals with definite hip OA, and 
74 individuals with hip OA. 



TABLE 7 



Phenotype 


Case 


Control 


Age at Visit 


62.7 


52.5 


Sex (Female/Male) 


227/119 


174/101 


Knee OA Xray: No 
Unknown 
Yes 


35% (120) 


80% (16) 


1%(4) 


0% (0) 


64% (221) 


20% (4) 


Hip OA Xray: No 
Unknown 
Yes 


63% (21 5) 


80% (16) 


2% (7) 


0% (0) 


35% (121) 


20% (4) 



Assay for Verifying. Allelotyping, and Genotyping SNPs 

[0225J Genotyping of the replication cohorts described in Tables 6 and 7 was performed using the 
same methods used for the original genotyping, as described herein. A MassARRAY™ system 
(Sequenom, Inc.) was utilized to perform SNP genotyping in a high-throughput fashion. This genotyping 
platform was complemented by a homogeneous, single-tube assay method (hME™ or homogeneous 
MassEXTEND™ (Sequenom, Inc.)) in which two genotyping primers anneal to and amplify a genomic 
target surrounding a polymorphic site of interest. A third primer (the MassEXTEND™ primer), which is 
complementary to the amplified target up to but not including the polymorphism, was then enzymatically 
extended one or a few bases through the polymorphic site and then terminated. 

[0226] For each polymorphism, SpectroDESIGNER™ software (Sequenom, Inc.) was used to 
generate a set of PCR primers and a MassEXTEND™ primer which where used to genotype the 
polymorphism. Other primer design software could be used or one of ordinary skill in the art could 



66 



PATENT 

Docket 524593008900 

manually design primers based on his or her knowledge of the relevant factors and considerations in 
designing such primers. Table 3 shows PCR primers and Table 4 shows extension probes used for 
analyzing (e.g., genotyping) polymorphisms in the replication cohorts. The initial PCR amplification 
reaction was performed in a 5 pi total volume containing IX PCR buffer with 1.5 mM MgCl 2 (Qiagen), 
200 uM each of dATP, dGTP, dCTP, dTTP (Gibco-BRJL), 2.5 ng of genomic DNA, 0.1 units of HotStar 
DNA polymerase (Qiagen), and 200 nM each of forward and reverse PCR primers specific for the 
polymorphic region of interest. 

[0227] Samples were incubated at 95°C for 15 minutes, followed by 45 cycles of 95°C for 20 
seconds, 56°C for 30 seconds, and 72°C for 1 minute, finishing with a 3 minute final extension at 72°C. 
Following amplification, shrimp alkaline phosphatase (SAP) (0.3 units in a 2 pi volume) (Amersham 
Pharmacia) was added to each reaction (total reaction volume was 7 pi) to remove any residual dNTPs 
that were not consumed in the PCR step. Samples were incubated for 20 minutes at 37°C, followed by 5 
minutes at 85°C to denature the SAP. 

[0228] Once the SAP reaction was complete, a primer extension reaction was initiated by adding a 
polymorphism-specific MassEXTEND™ primer cocktail to each sample. Each MassEXTEND™ 
cocktail included a specific combination of dideoxynucleotides (ddNTPs) and deoxynucleotides (dNTPs) 
used to distinguish polymorphic alleles from one another. Methods for verifying, allelotyping and 
genotyping SNPs are disclosed, for example, in U.S. Pat. No. 6,258,538, the content of which is hereby 
incorporated by reference. In Table 7, ddNTPs are shown and the fourth nucleotide not shown is the 
dNTP. 

[0229] The MassEXTEND™ reaction was performed in a total volume of 9 pi, with the addition of 
IX ThermoSequenase buffer, 0.576 units of ThermoSequenase (Amersham Pharmacia), 600 nM 
MassEXTEND™ primer, 2 mM of ddATP and/or ddCTP and/or ddGTP and/or ddTTP, and 2 mM of 
dATP or dCTP or dGTP or dTTP. The deoxy nucleotide (dNTP) used in the assay normally was 
complementary to the nucleotide at the polymorphic site in the amplicon. Samples were incubated at 
94°C for 2 minutes, followed by 55 cycles of 5 seconds at 94°C, 5 seconds at 52°C, and 5 seconds 
at 72°C. 

[0230] Following incubation, samples were desalted by adding 1 6 pi of water (total reaction volume 
was 25 pi), 3 mg of SpectroCLEAN™ sample cleaning beads (Sequenom, Inc.) and allowed to incubate 
for 3 minutes with rotation. Samples were then robotically dispensed using a piezoelectric dispensing 
device (SpectroJET™ (Sequenom, Inc.)) onto either 96-spot or 384-spot silicon chips containing a matrix 
that crystallized each sample (SpectroCHIP™ (Sequenom, Inc.)). Subsequently, MALDI-TOF mass 
spectrometry (Biflex and Autoflex MALDI-TOF mass spectrometers (Bruker Daltonics) can be used) and 
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SpectroTYPER RT™ software (Sequenom, Inc.) were used to analyze and interpret the SNP genotype for 
each sample. 

Genetic Analysis 

[0231] Genotyping results for replication cohorts #1 and #2 are provided in Tables 8 and 9, 
respectively. 

TABLE 8 



rsID 


Replication #1 
(Mixed Male/Female cases and Female controls) 

AF OA Con AF OA Cas Delta P-value 


Meta-analysis 
Disc. + Rep #1 


rs398829 


0.30 I 0.28 | 0.02 | 0.307 


0.0260 


TABLE 9 


rsID 


Replication #2 (Newfoundland) 
(Male/Female cases and controls) 

AFOACon AFOACas Delta P-value 


Meta-analysis 
Disc. + Rep #2 
Not Done 


rs398829 


0.27 | 0.28 | -0.013 | 0.627 





[0232] To combine the evidence for association from multiple sample collections, a meta-analysis 
procedure was employed. The allele frequencies were compared between cases and controls within the 
discovery sample, as well as within the replication cohort #1 using the DerSimian-Laird approach 
(DerSimonian, R. and N. Laird. 1986. Meta-analysis in clinical trials. Control Clin Trials 7: 177-188.) 

[0233] The absence of a statistically significant association in one or more of the replication cohorts 
should not be interpreted as minimizing the value of the original finding. There are many reasons why a 
biologically derived association identified in a sample from one population would not replicate in a 
sample from another population. The most important reason is differences in population history. Due to 
bottlenecks and founder effects, there may be common disease predisposing alleles present in one 
population that are relatively rare in another, leading to a lack of association in the candidate region. 
Also, because common diseases such as arthritis-related disorders are the result of susceptibilities in 
many genes and many environmental risk factors, differences in population-specific genetic and 
environmental backgrounds could mask the effects of a biologically relevant allele. For these and other 
reasons, statistically strong results in the original, discovery sample that did not replicate in one or more 
of the replication samples may be further evaluated in additional replication cohorts and experimental 
systems. 
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Example 4 
ADAMTS2 Region Proximal SNPs 

[02341 It has been discovered that SNP rs398829 in ADAMTS2 is associated with occurrence of 
osteoarthritis in subjects. This gene encodes a disintegrin and metalloproteinase with thrombospondin 
motifs-2 (ADAMTS2), which is a member of the ADAMTS protein family. Members of the family share 
several distinct protein modules, including a propeptide region, a metalloproteinase domain, a 
disintegrin-like domain, and a thrombospondin type 1 (TS) motif. ADAMTS2 is involved in collagens 1, 2 
and 5 N-termmal processing, (type II collagen is the major form in cartilage). Mutations in this gene 
cause Ehlers-Danlos syndrome type VIIC, a recessively inherited connective-tissue disorder that causes 
loose joints and fragile skin. Mild loss of function may exacerbate physical joint damage leading to a 
predisposition to OA and incorrectly processed collagen can act dominantly to inhibit self assembly of 
fibrils. Alternative splicing of the gene generates 2 transcript variants. The short transcript encodes a 
protein, which has no significant procollagen N-peptidase activity. 

[0235] Two hundred-nine additional allelic variants proximal to rs398829 were identified and 
subsequently allelotyped in osteoarthritis case and control sample sets as described in Examples 1 and 2. 
The polymorphic variants are set forth in Table 10. The chromosome positions provided in column four 
of Table 10 are based on Genome "Build 34" of NCBI's GenBank. 



TABLE 10 



dbSNP 
rs# 


Chromosome 


Position in SEQ 
ID NO: 1 


Chromosome 
Position 


Allele 
Variants 


rs2278221 


5 


210 


178695460 


c/t 


rs 1650358 


5 


3608 


178698858 


da 


rs 16438 18 


5 


3609 


178698859 


C/Q 


rs3733916 


5 


4318 


178699568 


c/t 


rs1 624933 


5 


5593 


178700843 


a/g 


rs 1624857 


5 


5629 


178700879 


c/t 


rs1 624832 


5 


5639 


178700889 


a/g 


rs 1624829 


5 


5640 


178700890 


c/t 


rs2161171 


5 


8943 


178704193 


a/c 


rs 1530499 


5 


17968 


178713218 


a/g 


rs888764 


5 


19887 


178715137 


a/g 


rs873987 


5 


21034 


178716284 


a/g 


rs4078699 


5 


21085 


178716335 


c/t 


rs87031 1 


5 


21596 


178716846 


a/g 


rs 16438 17 


5 


23379 


178718629 


a/c 


rs 16438 16 


5 


23432 


178718682 


a/c 


rs 1650355 


5 


24007 


178719257 


a/c 


rs888763 


5 


26121 


178721371 


a/g 


rs1862212 


5 


26273 


178721523 


a/t 
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dbSNP 
rs# 


Chromosome 


Position in SEQ 
ID NO: 1 




Chromosome 
Position 


Allele 
Variants 


rs1 1 10514 






178722005 


aA 


rs3797600 








1 78722661 




rso f y t duz 






I for zi^yoU 


- 

g/t 


rs3797603 









27842 


178723092 


c/t 


rs3776819 




28379 


178723629 


c/t 


rs252076 







29603 


178724853 


5/t 


rs252075 






178726482 


c/g 


rs252074 


g 


31504 


178726754 


a/g 


rs252068 


5 


32583 


178727833 


c/g 


rs252069 


5 


32794 


178728044 


a/g 


rs 194040 


5 


32840 


178728090 


c/t 


rs252070 


5 


33044 


178728294 


c/t 


rs3797606 


5 


33150 


178728400 


a/c 


rs1 71667 


5 


33218 


178728468 


a/g 


rs 187539 


5 


33513 


178728763 


c/t 


rs3836834 



5 


33959 


1 78729209 


/tatcaaactaccatga 

aa 


rs2 52071 





34486 f 


178729736 


a/g 






36289 


178731539 


c/t 







36570 


1 78731 820 


c/t 


rso /yooy 





38247 


178733497 


a/t 


rs2052472 







38477 


1 78733727 


a/c 


rs2052471 — 





38518 


1 78733768 


c/t 


rs2052470 


5 


38529 


1 78733779 


c/t 


rs2052469 




38667 


178733917 


a/g 


rs3797608 







39781 


178735031 


c/t 


rs3797609 





39856 


178735106 


c/t 


rs3822601 




39927 


178735177 


c/t 


rs153131 





40506 


178735756 


a/g 


rs751 546 





41869 


1787371 19 


c/g 


rs2279979 





42452 


1 78737702 


c/t 


rs2 52060 





44788 


178740038 


c/t 


rsov y/oiu 





46059 


178741309 


a/c 


rs 194039 







46846 


178742096 


a/g 


rs i do /to 





47712 


1 78742962 


a/t 


rs252061 





48796 


1 78744046 


c/t 


rs1 87537 




49441 


178744691 


c/g 


rs252062 







49602 


1 78744852 


a/t 


rs^4oi ^do 






• 178744973 




rs3797612 


g 


50050 


1 78745300 


c/t 


rs3797613 


5 


50171 


178745421 


c/t 


rs614114 


5 


50477 


178745727 


c/t 


rs252063 


5 


50818 


178746068 


c/t 


rs252064 


5 


50833 


178746083 


c/t 


rs252065 


5 


50881 


178746131 


a/g 


rs450502 


5 


50882 


178746132 


a/g 


rs439252 


5 


51386 


178746636 


c/t 
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dbSNP 
rs# 


Chromosome 




Position in SEQ 
ID NO: 1 


Chromosome 
Position 


Variants 


rc9R9nRr\ 

rszozuDD 





51534 


1 78746784 


c/t 






5231 


1 78747567 


a/g 


rso f y / o i4 







cooro 


1 78747618 


c/t 






o 




1 78748220 


a/g 


rs398829 








1 78748273 


a/g 


rs41 6646 





53356 


1 78748606 


a/g 


rs 187450 





53882 


178749132 


g/t 


rs337807 





54553 


1 78749803 


c/t 


rsoo fouo 





55475 


1 78750725 


a/c 


rs1 396438 





55530 


1 78750780 


a/g 


rs1 396437 





55691 


1 78750941 


c/t 


rs241 181 1 







55848 


1 78751 098 


a/c 







55879 


1 78751 129 


c/g 


rs i oazOD 




56316 


178751 566 


a/g 


rs1 73072 







56911 


178752161 


a/c 


rsoo/ouo 





57320 


178752570 


a/g 


re 1 Q 1 a \ 

rs i y 1 4 1 o 





57391 


178752641 


c/t 


rel Qnnd*\ 

rs iouu4D 





57437 


1 78752687 


c/t 


re 1 OQORC; 

rs I oy^DD 


— 


57478 


178752728 


c/g 


rSDDZ/DD 




57500 


178752750 


crt 


rSHDD/DU 


■ 


59111 


1 78754361 


g/t 


rs^^uo 




5 , 


59333 


178754583 


a/g 






59715 


1 78754965 


a/g 


rs592971 







59804 


178755054 


a/g 


rs457187 





59851 


178755101 


a/g 


rs459490 




59929 


178755179 


c/t 


rs459668 







60052 


1 78755302 


c/t 


rs462646 





60240 


178755490 


c/t 


rs458272 





60359 


178755609 


g/t 


rs463455 




60381 


178755631 


a/g 


rs675880 





60456 


178755706 


c/t 


rs81061 7 





60724 


178755974 


c/g 


rs464156 







60875 


178756125 


c/t 


rs458083 




60968 


178756218 


a/g 


rs467333 







60978 


178756228 


c/g 


rs465381 




60998 


178756248 


c/t 


rs466363 





61557 


178756807 


c/t 


rs2457099 





62091 


178757341 


c/t 


rs463901 







178757895 




rs465621 


^ 


62943 


178758193 


a/c 


rs463724 


5 


63131 


178758381 


a/t 


rs465242 


5 


63145 


178758395 


g/t 


rs467419 


5 


63406 


178758656 


a/g 


rs456135 


5 


63427 


178758677 


c/g 


rs464536 


5 


63554 


178758804 


c/t 


rs461898 


5 


63661 


178758911 


a/g 


rs389558 


5 


64093 


178759343 


a/g 
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Chromosome 
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Position 


Variants 


rs466752 




64153 


178759403 


2rt 







64409 


i /o/DyoDy 


c/g 


rs463435 





64544 


178759794 


c/t 


rs2 174971 





65257 


178760507 


c/t 


rs1 979979 





65626 


178760876 


a/g 


rs41 1804 





65739 


178760989 


a/g 


rs 1623885 





66392 


178761642 


c/t 


rs1 643811 





66720 


178761970 


c/t 


rs434430 





69177 


1 78764427 


a/t 


rs1 87538 





69336 


178764586 


g/t 


rs252067 





69636 


178764886 


a/g 


rs459319 





69823 


178765073 


a/g 


rs467289 





69928 


178765178 


cA 


rs462644 


- 


70547 


178765797 


c/t 


rs458752 





70633 


178765883 


c/t 


rs708320 





71805 


1 78767055 


a/c 


rs457954 





72181 


178767431 


c/g 


rs241 1810 





72200 


178767450 


c/t 


rs3084687 





72474 


178767724 


^at 


rs69638 





72567 


178767817 


c/g 


rs455452 





72973 


178768223 


a/g 


rs464850 





73468 


178768718 


a/g 


rs431472 







73889 


178769139 


a/g 


rs2411809 




75730 


178770980 


c/t 


rs2457094 


- 


75970 


178771220 


a/q 


rs2457095 





76114 


178771364 


a/g 


rs2261740 




5 


76342 


178771592 


c/t 


rs1 109180 




76449 


178771699 


a/g 


rs1109179 




5 


76465 


178771715 


c/t 


rs1109178 


5 


76791 


178772041 


a/c 


rs456909 


5 


78042 


1 78773292 


a/g 


rs469124 




80758 


1 78776008 


a/g 


rs468039 





80778 


178776028 


c/t 


rs467017 





81356 


178776606 


a/c 


rs469290 





81576 


178776826 


a/g 


rs469090 


- 


81689 


178776939 


c/t 


rs469568 





81759 


178777009 


g/t 


rs468386 





81950 


178777200 


c/g 


rs469349 







178777812 




rc^RQnQQ 

rs-^oyuyy 


1 


83591 


\ lot l OO^l 


c/t 


rs456868 


5 


83700 


178778950 


a/g 


rs465389 


5 


83821 


178779071 


c/g 


rs463892 


5 


83842 


178779092 


c/g 


rs468548 


5 


83923 


178779173 


g/t 


rs654612 


5 


83929 


178779179 


a/c 


rs468542 


5 


84021 


178779271 


c/g 


TS469262 


5 


84175 


178779425 


c/t 
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rs708323 




84417 


1 78779667 


a/9 


rs469089 







1 78779997 


c/g 


rs469396 









85746 


178780996 


c/g 


rs468723 




86129 


178781379 


5/1 


rs467604 





86335 


178781585 


a/g 


rs338874 





87315 


1 78782565 


c/g 


rs338875 





87648 


178782898 


a/g 


rs 1385803 





87764 


178783014 


a/c 


rs1 385804 







87770 


178783020 


c/g 


rs338876 




88221 


178783471 


5/1 


rs1 89803 





90474 


1 78785724 


a/c 


rs452215 





91148 


1 78786398 


g/t 


rs641 170 







91150 


178786400 


g/t 


rs584398 





91160 


178786410 


g/t 


rs385330 




91733 ) 


178786983 


5/1 


rs429538 







91772 , 


178787022 


a/c 


rs371229 





91785 


178787035 


c/t 


rs460874 





93140 


178788390 


a/t 


rs646121 




93148 


178788398 


5/1 


rs468262 




5 


96080 


1 78791 330 


§/9 


rs467863 




96157 


178791407 


c/g 


rs191434 





96313 


178791563 


ale 


rs2054782 





96759 


1 78792009 


c/t 


rs468499 





97026 


178792276 


a/c 


rs1 80287 







97320 


178792570 


c/g 


rs338877 




97732 


1 78792982 


a/t 


rs650665 







98713 


178793963 


c/g 


rs193419 




99707 


1 78794957 


a/c 


rs1 80288 







99959 


1 78795209 


c/g 


rs 186834 





100009 


178795259 


a/g 


rs 189266 





1 00020 


178795270 


c/g 


rs 189267 




100065 


178795315 


a/c 


rs1 70937 


- 


100086 


178795336 


c/g 


rs463263 





101270 


178796520 


c/g 


rs463262 





101276 


178796526 


g/t 


rs460454 





101371 


178796621 


c/t 


rs460455 





101376 


1 78796626 


c/g 


rs460505 





10 1 4 . 3 ?- 


178796689 


c/t 


rs931316 




5 


101820 


178797070 


c/t 


rs463431 




1 02392 


1 ~7R~7(X7RAO 
I 1 O f y t Xi^c. 




rs461542 


5 


102602 


178797852 


a/g 


rs463557 


5 


102604 


178797854 


a/c 


rs191453 


5 


102896 


178798146 


c/t 


rs2271212 


5 


189104 


178884354 


c/t 


rs462009 


5 


189134 


178884384 


c/t 


rs2271211 


5 


189205 


178884455 


a/g 


rs396474 


5 


Not mapped 


Not mapped 


a/c 
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Chromosome 
Position 


Allele 
Variants 


rs428901 


5 


Not mapped 


Not mapped 


a/t 


rs452300 


5 


Not mapped 


Not mapped 


g/t 


rs670256 


5 


Not mapped 


Not mapped 


g/t 



Assay for Verifying and Allelotyping SNPs 
[0236] The methods used to verify and allelotype the 209 proximal SNPs of Table 10 are the same 
methods described in Examples 1 and 2 herein. The primers and probes used in these assays are provided 
in Table 1 1 and Table 12, respectively. 



TABLE 11 



dbSNP 
rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


rs2278221 


ACGTTGGATGTCTCATGGGCCACCACAAAC 


ACGTTGGATGTATGCTCCTGTCACCGGCAT 


rs1 650358 


ACGTTGGATGTGGATGGCTCCATGTTCTTG 


ACGTTGGATGAAGTGCTGGGATTACAGGTG 


rs1643818 


ACGTTGGATGCTGGGATTACAGGTGTGAAC 


ACGTTGGATGTGGATGGCTCCATGTTCTTG 


rs3733916 


ACGTTGGATGCCGAGCAGGCTGTAGTGTTG 


ACGTTGGATGCTTTGTACCACCTGGAACAG 


rs1 624933 


ACGTTGGATGAGGCTGGTCTCAAACTCCTG 


AC GTT G G ATGT AAC AAAAAGTTG GCCGTGC 


rs1 624857 


ACGTTGGATGTGAGGTCAG G AGTTTG AG AC 


ACGTTGGATGGCCACCAAGCCAGACTAAGT 


rs1 624832 


ACGTTGGATGTGAGGTCAGGAGTTTGAGAC 


ACGTTGGATGGCCACCAAGCCAGACTAAGT 


rs1 624829 


ACGTTGGATGTGAGGTCAGGAGTTTGAGAC 


ACGTTGGATGGCCACCAAGCCAGACTAAGT 


rs2161171 


ACGTTGGATGCCCGTCACCACTTTATTTCC 


ACGTTGGATGAGAGTGGATCCAGTCTGCAG 


rs 1530499 


ACGTTGGATGACTCCAAGATTTCCCATTTC 


ACGTTGGATGTTCTGTGTTCCACCCTATGG 


rs888764 


ACGTTGGATGTAGTTGAATGTTGTATTGGC 


ACGTTGGATGACCGTGATAAACACAGAATG 


rs873987 


ACGTTGGATGGCTGTTAATCATGTGTCGGG 


ACGTTGGATGATTTGGCCACATCACCAGAC 


rs4078699 


ACGTTGGATGGTACCGTGGATTCTTTTAGG 


ACGTTGGATGGTATTGGAAAAGAGCAGAGAC 


rs870311 


ACGTTGGATGTCAGGGCTCCAGTGTTGAAG 


ACGTTGGATGAAAAGGAGGAGTGCCCTGTG 


rs1643817 


ACGTTGGATGATGGGAAACTCCTGGTCCTG 


ACGTTGGATGAAAATGCAAGCCGCCACCTG 


rs1643816 


ACGTTGGATGTTTTCTCCCCTTTCTAGCCC 


ACGTTGGATGTTGGCATGAGAGATGGACAG 


rs1 650355 


ACGTTGGATGTCAACAGCAACAAAACCAAA 


ACGTTGGATGTTAAATAGGTCAGAGGGTTG 


rs888763 


ACGTTGGATGAAGAGGAAGAGACATACCAG 


ACGTTGGATGAACAACATGGACTCAGGCTG 


rs1862212 


ACGTTGGATGGGCCACATTTTAAAACAAGGG 


ACGTTGGATGTCCCCTGAGGTTCCTATAAG 


rs1 110514 


ACGTTGGATGTGCCCACGTTCCATGTTCAG 


ACGTTGGATGATCACTGTAGCCCCTTCCTG 


rs3797600 


ACGTTGGATGCCTTCCTGTCACCTCCTTTG 


ACGTTGGATGGGAAGTGACTGCTGAGCTG 


rs3797602 


ACGTTGGATGAGAAACAGGGACTGGCTGTGT 


ACGTTGGATGAGCAGGCTCCGGGAAGTATG 


rs3797603 


ACGTTGGATGCACCCATCCATCATGATGTC 


ACGTTGGATGTGCTACCTCAAAACAGTGGG 


rs3776819 


ACGTTGGATGCAAGCACCATTCCATTGCAC 


ACGTTGGATGAAATGAGGATTGCAGTCCCC 


rs252076 


ACGTTGGATGACTTCTGACTTCAGGTGATC 


ACGTTGGATGTATAGGAACGAAAGAAAGCC 


rs252075 


ACGTTGGATGTGGGAGCATTTGCAGGCATG 


ACGTTGGATGAAGCCTCAGATGGTTCGGAG 


rs252074 


ACGTTGGATGTTGCGATGGCCTCCTGGCT 


ACGTT GG ATG AAGTTG AG G G CTCCG GAG C A 


rs252068 


ACGTTGGATGGGGTAGGAAGGGTTTAAGC 


ACGTTGGATGGCAGCCCCTCAATTCTTTAG 


rs252069 


ACGTTGGATGTGCCCATTTCCTGTTATTCC 


ACGTTGGATGTTTGGACTTGCCGTGCAACT 


rs1 94040 


ACGTTGGATGTGCCCATTTCCTGTTATTCC 


ACGTTGGATGTTTGGACTTGCCGTG CAACT 


rs252070 


ACGTTGGATGCTCAAGGACATTGTCCCTGG 


ACGTTGGATGGGAGAAGCAGCTCTCCTTTC 
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Forward 
PCR primer 


Reverse 
PCR primer 


rs3797606 


ACGTTGGATGGTTTCCCCAAACAAGAGAGC 


ACGTTGGATGGGAAATGTTCAAAGCCGCAG 


rs171667 


ACGTTGGATGGGGAAACACATTGTAATGCG 


ACGTTGGATGCCTTCCTCATTGTCTATTCC 


rs1 87539 


ACGTTGGATGAGCCACCCCAACCTTCAGGA 


ACGTTGGATGTTGCTCCTGGACATGG I I I I 


rs3836834 


ACGTTGGATGAAGAAACGTGACTCTTGCTC 


ACGTTGGATGTAGTAATTCTGATCCTGGCC 


rs252071 


ACGTTGGATGGCTTCAACCTGAAACAACCC 


ACGTTGGATGGGGATATTCCCACTCTGAG 


rs252072 


ACGTTGGATGTTGTTTCCCCAAAGGCGACG 


ACGTTGGATGTGTG I I I I CCAGAGCTGGAG 


rs252073 


ACGTTGGATGGGGAAAGGCCGAGAAAAGTC 


ACGTTGGATGACAAGCTCAGCAGAGTTCCA 


rs379589 


ACGTTGGATGAAACACGGGAGTACTGAGCA 


ACGTTGGATGTTGTTAGCTGTCTGTCCGTC 


rs2052472 


ACGTTGGATGAACCAGCTCAAGGATCACCC 


ACGTTGGATGAAAGGAGACGGTCAGCTGTC 


rs2052471 


ACGTTGGATGACAGCTGACCGTCTCCTTTG 


ACGTTGGATGCCCGTCCTGGACAAGCTTTT 


rs2052470 


ACGTTGGATGACAGCTGACCGTCTCCTTTG 


ACGTTGGATGCCCGTCCTGGACAAGCTTTT 


rs2052469 


ACGTTGGATGAGGGAAAGATATCGCACGCG 


ACGTTGGATGAGTGAACAACTGCTCGCCTC 


rs3797608 


ACGTTGGATGTGCTTTGCCTTGGCTTCTGC 


ACGTTGGATGTGCACTAAGGGAGTGAGTGG 


rs3797609 


ACGTTGGATGTGCAGAAGCCAAGGCAAAGC 


ACGTTGGATGACAGCATTTGGAGTCCCCTG 


rs3822601 


ACGTTGGATGAGGTCAGTGAGGCCTGAGAT 


ACGTTGGATGTGTCTGGCCTGAAGATCGAG 


rs153131 


ACGTTGGATGTAATCACGTGTCCTGATCCC 


ACGTTGGATGAGCTGTCCTCAGTCATGTTC 


rs751546 


ACGTTGGATGTCCTGCTCTGCCGTTCTACA 


ACGTTGGATGATCAGCTCAAAGGACCGGTG 


rs2279979 


ACGTTGGATGTATTGCTACCAGGAACACGTA 


ACGTTGGATGAAAAAGGGGCCACTTCAGGG 


rs252060 


ACGTTGG ATGTGGCCAG AG CCCGTGTTT C 


ACGTTGGATGCGGCCAATCCCATCTCTATG 


rs3797610 


ACGTTGGATGAAAAGCTTCTCCCTTGGGTG 


ACGTTGGATGCAAGTAGGGCAGAAACTCAG 


rs1 94039 


ACGTTGGATGAAAGTGCTGGGATTACAGGC 


ACGTTGGATGTGCTGGGAGAAGACATTCAC 


rs1 68773 


ACGTTGGATGTGGTGCACTGGAGATATCAC 


ACGTTGGATGGATCCCTATCCTACCTCTTC 


rs252061 


ACGTTGGATGTGTCACACTCCTCTTGTAAG 


ACGTTGGATGCTGTCCTTCCATGCTTTTGC 


rs1 87537 


ACGTTGGATGCGAGGATGTCATGCTAAGTG 


ACGTTGG ATG GTAC CTCG C AT AAGTG G ATC 


rs252062 


ACGTTGGATGAAGCACATTCATGTGGCTGG 


ACGTTGGATGCTGAAACTCAATGGGCACAG 


rs2431255 


ACGTTGGATGGGTGAAGACGGTGACTTATG 


ACGTTGGATGCTGGTGTCCTTGAAGAACTG 


rs3797612 


ACGTTGGATGAGTGAGGACGCAGGGCATTC 


ACGTTGGATGAGCGTGGGCGAGGGAGATAA 


rs3797613 


ACGTTGGATGATCAGAGGCAGAGACCCCCC 


ACGTTGGATGGGGTGTCCTGCAGAGGCGG 


rs614114 


ACGTTGGATGGGTTGGAGGATGTCTAGAAC 


ACGTTGGATGGGCTGGATCACTAGGGTTTG 


rs252063 


ACGTTGGATGTTGGAATTACAGTCCGATGG 


ACGTTGGATGCTGAGAGACTGAAAAGCACA 


rs252064 


ACGTTGGATGCTGAGAGACTGAAAAGCACA 


ACGTTGGATGTTGGAATTACAGTCCGATGG 


rs252065 


ACGTTGGATGAAAACTAAGGCTCAGAGGAC 


ACGTTGGATGTGGGCTTGGAATTACAGTCC 


rs450502 


ACGTTGGATGATGAGAAAACCAAGGCTCAG 


ACGTTGGATGCTGGGCTTGGAATTACAGTC 


rs439252 


AC GTTG G ATG ATCTCCTG ACCTCGT G ATC C 


ACGTTGGATGTCATAATAACGGCCGGGTGC 


rs252066 


ACGTTGGATGTTTCCTCTTGACCGGTCTTG 


ACGTTGGATGTAAAACGAATTCTGCCGATG 


rs457957 


ACGTTGG ATGTTC ACGTGC ATT AG AG CG AG 


ACGTTGGATGAATTCCCTCCCCAATTCCTC 


rs3797614 


ACGTTGG ATG ACTG CG AG CTTTAAG G AG G G 


ACGTTGGATGCCAAACAGAAGCCCCTTTTC 


rs423552 


ACGTTGGATGGCAGGACCTCGATGTTGTAG 


ACGTTGGATGATCCTAGAGGAGCACGCCAAC 


rs398829 


ACGTTGGATGTAGTCATCGTCCGCAGCATG 


ACGTTGGATGAAGACGGTGTCCTCTCCTTG 


rs4 16646 


ACGTTGGATGGCTGGGTCTCTCACAGTCTC 


ACGTTGGATGAGACAGGCACCTCTGTGACTT 


rs1 87450 


ACGTTGGATGAGAAGGCAGGGACGATATCC 


ACGTTGGATGACCAAGATGAACCCCTCTGT 


rs337807 


ACGTTGGATGTCACCCAGTGCTGACAGCAG 


ACGTTGGATGATGCTGGGATGCCATGGGTC 


rs337806 


ACGTTGGATGAATTAAGAGATGGGGCCACC 


ACGTTGGATGGCCCTGTGTGTTTTGTCTCC 


rs 1396438 


ACGTTGGATGTACCTTCTGGTGCCAGAATG 


ACGTTGGATGCCTGGAGACAAAACACACAG 


rs 1396437 


ACGTTGGATGTAAAAACTCTGCCTGCTCGG 


ACGTTGGATGTCCAGACATTCCCCGTAGGA 


rs2411811 


ACGTTGGATGGAGGGATGCTCTAGAACATA 


ACGTTGGATGCTGAATTTCACCTGAAATGG 


rs2898813 


ACGTTGGATGTCCTCACCCACTTTGCCTTT 


ACGTTGGATGATCGTGATAATTTTGGGGTG 
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dbSNP 
rs# 


Forward 
PCR primer 


PCR primer 


rsioy^oo 


AUo I 1 ooA 1 1 LUU 1 A 1 AoOAAtjtjO 1 O 1 A 


ACG 1 1 GGATG 1 1 AACCCAGGCCATGAAGAG 




a r/^TTrr ATrArrTrrAp ATrTPTTTrrTp 
AOu 1 1 bbA 1 bAut 1 ubAbA ILIUI 1 1 VjU \ L* 


ACG I 1 GGATGCTAAAACAGGATGGCTCTGG 


rs337805 


ACG 1 1 GGATGGAAACAAACCAAGGAGCAGG 


ACGTTG G ATG ATGTG G AC AAC GTTG G ACTC 


rs191415 


ACGTTGGATGAA 1 1 ACATGACTCGGACAAG 


ACGTTGGATGTGCTGGTGAAGTACAGAAGG 


rs1 80045 


ACGTTGGATGGTCCCAGG mm CTGTTCTC 


ACGTTGGATGTGTACTTCACCAGCACTGAG 


rs1 89255 


ACG I I GGATGAGG 1 1 GCAGACTCAGTCCCA 


ACGTTGGATGGGGTGA I I I GCGGGAATGAG 


rs652766 


ACG 1 1 GGATGGGGTGA 1 1 1 GCGGGAATGAG 


ACGTTGGATGACCATCCCACGATGCTCCC 


rs466750 


ACGTTGGATGTATCTCC 1 I AAATGCCTTGG 


ACGTTGGATGTGACCAGGAGGAGTTAAAAC 


rs442406 


ACGTTGGATGTGACAAGGTCAC GTGTTCTG 


ACGTTGGATGCCAGACAAGTCTGATACAGC 


rs662407 


ACG 1 1 GGATGCCACAGTCACCATTACTGAG 


ACGTTGGATGCTTGAGCCATGAGTGGAATG 


rs592971 


ACG 1 1 GGATGGGAAGCA 1 1 TC 1 1 1 GACTGC 


ACGTTGGATGATTCCATCTCATGGCTCAAG 


rs457187 


ACG 1 1 GGATGTGTGAGATGAGGAGTATCTG 


ACGTTGGATGGCAGTCAAAGAAATGCTTCC 


rs459490 


ACGTTGGATGACAGATACTCCTCATCTCAC 


ACGTTGGATGGGGAG I I 1 I GCTGTTATAGC 


rs459668 


ACG 1 1 GGATGGC I I CA 1 1 1 ACTGAGGTC 1 I C 


ACGTTGGATGTGAATGTTCAACGACTACAC 


rs462646 


ACGTTGGATGCAA MAM CGACGGAGATTA 


ACGTTGGATGCTCCTCCAAATGAATCAAGAA 




ACG I I GGATGATGCCTCCTCA 1 1 GTCATTC 


ACGTTGGATGCCCAACAAAGTGATTCCAAC 


rs463455 


ACGTTGGATGATGCCTCCTCA 1 I GTCATTC 


ACGTTGGATGCCCAACAAAGTGATTCCAAC 


rs675880 


ACG 1 1 GGATGCAGCTCCA 1 1 GATCTG M 1 C 


ACGTTGGATGAAGAATGACAATGAGGAGGC 


rs8 1 06 1 7 


ACGTTGGATGTGATCTCAGCTTACCACAGC 


ACGTTGGATGATGCCTGTAATCCCAGCTAC 


rs464156 


ACGTTGGATGCAGATCCAAGAATATGTGGG 


ACGTTGGATGTTCTAGAAAGGAGCCAAATC 


rs4 58083 


ACGTTGGATGTG M G M 1 CTTCCCCTCCTG 


ACGTTGGATGTGGCTCC 1 1 1 CTAGAATCCC 


rs467333 


ACG 1 I GGATGC MGMAlllCll CCCCTCC 


ACGTTGGATGTTGGCTCCTTTCTAGAATCC 


rs465381 


ACGTTGGATGACTTGCCCATCTG 1 1 1 CCAG 


ACGTTGGATGACAAGCCTCTAAGGATAGGG 


rs466363 


ACGTTGGATGAAGTGACCCTGAGGTGATGG 


ACGTTGGATGTGAAGACAGTTCACCCCGTG 


rs24 57099 


ACGTTGGATGTCTCCTTACACTGCCAGCGT 


ACGTTGGATGCACTGTATTGCTACTTGAGC 


rs463901 


ACG 1 I GGATGAGAGTGCCAAGTGCAAAAGG 


ACGTTGGATGTGTCTTGCGTCTGTGTATCC 


rs465621 


ACGTTGGATGGGAAGTCATGGAAGTGCTAG 


ACGTTGGATGAAAGAGCCCTAGGCTTGGAA 


rs463724 


ACGTTGGATGAGTGTGCCTGTCTGCCCTCA 


ACGTTG G ATG AAG G GC AG ATG GC AC ACTTG 


rs465242 


ACGTTGGATGAGTGTGCCTGTCTGCCCTCA 


ACGTTGGATGAAGGGCAGATGGCACACTTG 


rs4674 1 9 


ACGTTGGATGAGTCCCCAAAACGTAAGTCC 


ACGTTGGATGAGTCTAATTCCCTGAGCCTC 


rs456135 


ACGTTGGATGAGTCTAATTCCCTGAGCCTC 


ACGTTGGATGACGTAAGTCCTAATGACCGC 


rs464536 


ACGTTGGATGTGCTCCAGGC I 1 I GGTCCTC 


ACGTTGGATGAATTAGACTAAGGCCATGATG 


rs461898 


ACG I I GGATGGGGAATACACAGCCACAGAG 


ACGTTGGATGAGGTCAACGGGAACAAGGTC 


rs389558 


ACGTTGGATGGCAGTCCTGACAGTTCTCTA 


ACGTTGGATGl 1 1 1 1 CTCCCTGAAGCATGG 


rs466752 


ACGTTGGATGGGCC 1 1 CTCTCC 1 1 1 AGTGC 


ACGTTGGATGAGTCCTGACAGTTCTCTAAA 


rs455655 


ACGTTGGATGCTA I I I GCACCCCATATGGC 


ACGTTGGATGAACACACAGCATCAGGTTCC 


rs463435 


ACG I I GGATG I 1 CAGCCATAGCTGGA M I G 


ACGTTGGATGCTCTGCTGGGAAAATGTGAC 


rs21 74971 


ACGTTGGATGAACACAAC 1 I CCCCTTCGTC 


ACGTTGGATGTGAATCCTTGGAGGTGAGTG 


rs1 979979 


ACGTTGGATGTGGCTGTCAGCACCCCACTT 


ACGTTGGATGCCCAAAGGAAGGGAGAATTC 


rs4 11804 


ACGTTGGATGCAGATGACAGGCGGAAAATC 


ACGTTGGATGAGGCTTCCAGATGATGTCCA 


rs1 623885 


ACGTTGGATGAATCAGCTAGGAAGAGCCTG 


ACGTTGGATGTTCCTGACCCCTCTAGGTCAG 


rs1 643811 


ACGTTGGATGCAGGGCCCTGGTACTTTCAG 


ACGTTGG ATGCATGGTGGTGATTG CACCTG 


TS434430 


ACGTTGGATGTCCAGGAGTTCACTGTAGAG 


ACGTTGGATGCACATGCATACATTCATCAC 


rs 187538 


ACGTTGGATGACATGGGGCTTGGCAAAATG 


ACGTTGGATGCACCTGCTCAGAAGTAGCAT 


rs252067 


ACGTTGGATGAGAATTGCTGTGGTGTGAGG 


ACGTTGGATG M M 1 CTTGGGAGCTGTCGC 


rs459319 


ACGTTGGATGCCATCTCTCTGACCTAGACA 


ACGTTGGATGGCTCCAAGGAAAATTGGGAG 


rs467289 


ACGTTGGATGGGCCCTCTTGGCTTGTCTTT 


ACGTTGGATGAGGCAGTGTGCCCTCTCATC 



76 



PATENT 

Docket 524593008900 



rs# 


Forward 
PCR primer 


PCR primer 


rs462644 


ACGTTGGATGATGATGTGGGTGAGCCCTTG 


ACGTTGGATGTAACACTCAGCACGCACCAG 


rs458752 


ACG I I GGATGCACCCACATCATGTGCGCTT 


ACG 1 1 GGATGCCCTTCTCTACCCAGCAC 1 1 


rs708320 


ACGTTGGATGAAACCAGCCTGGCTAACATG 


ACGTTGGATGACAGGTGCCTGCTATCATAC 


rs457954 


ACG I I GGATGAACCAGACC I I GACTGATGG 


ACG 1 1 GGATGCCTCATACAAGTAGCCAAGG 


rs24 11810 


ACG I I GGATGGC I I AACCAGACCTTGACTG 


ACGTTGGATGAGTGTAAGGATATCCACGGC 


rs3084687 


ACGTTGGATGATCCCTTGAGCCAGAGATTC 


ACGTTGGATGATGTCCTGTGCACACACAAG 


rs69638 


ACGTTGGATGTGCTCATTGCTGTCCTCATC 


ACGTTGGATGAGAAGAAAGGTGTGCAGTGG 


rs455452 


ACGTTGGATGAGTGATGATGAGCCTGCTGG 


ACGTTG GATGTCAGGTTCCCTCTCTGTGTC 


rs464850 


ACGTTGGATGTCTCTCTGTGCTCCAGACCA 


ACGTTGGATGTGGGCTGAGATTTCTGTGGG 


rs431472 


ACGTTGGATGAACCAGTGTGGGTGTGAAGC 


ACGTTGGATGAGAGACTGCATCAGGCAGGA 


rs241 1809 


ACGTTGGATGAGCGCATAAGTGACCACCAG 


ACGTTGGATGGCACTCACAGGGCATTGATG 


rs2457094 


ACGTTGGATGTTACTGTCACCTTGGGTCTC 


ACGTTGGATGGGAAGTCTGTATAGACGCAG 


rs2457095 


ACGTTGGATGTTATCAAGGCCTGCGCAGTG 


ACGTTGGATGACTCCTGACCTCAGGCAATC 


rs2261740 


ACGTTGGATGATCGTGCCACTGCACTCCAG 


ACGTTGGATGTCATC 1 I 1 I GGTAGCCCCCC 


rs1 1 09180 


ACGTTGGATGCCAGGCCTGTATTGCACATC 


ACGTTGGATGAGAATGCGTGTGCATGTGGG 


rs1 1091 79 


ACGTTGGATGTGTAATGGTATGCAGACCCC 


ACGTTGGATGGAGTGCCGTATTTGTCCTTC 


rs1 109178 


ACGTTGGATGGCAAACAACAACAGCAACAG 


ACGTTGGATGAAGTGTGGATTTGTGCAGAC 


rs456909 


ACGTTGGATGTAGCTGCTTCATCTGTAAAG 


ACGTTGGATGGGCACTTTACCGATCTACTC 


rs469124 


ACGTTGGATGACTTGGACACACATAGGCTG 


ACGTTGGATGTGAAATGCTCAGGGTGTGTG 


rs468039 


ACGTTGGATGTGAAATGCTCAGGGTGTGTG 


ACGTTGGATGAGGACTTGGACACACATAGG 


rs467017 


ACGTTGGATGGTCTAGCTGCCACTAAACAG 


ACGTTGGATGATGTGCCAAGAGGCTTTGAG 


rs469290 


ACGTTGGATGTGCCC I I I GTGTGCTCAGAG 


ACGTTGGATGTCCCTCTGTGCTGTGTTTGG 


rs469090 


ACGTTGGATGACTTGTCTTCAGGTGCTTGG 


ACGTTGGATGGATGGTTAGTCTCCTGGTTC 


rs469568 


ACGTTGGATGAGCACCTCTGGC I I I CATTG 


ACGTTGGATGATTCACCAGGAAATCCCAAC 


rs468386 


ACGTTGGATGTAATCCCAGCCC I I I GGAAG 


ACGTTGGATGTATGGAGACAGGG I I I lACC 


rs469349 


ACGTTGGATGTTAGAGACAGAGTCTCACTC 


ACGTTGGATGTTGATCCCAGGAGTTCAAGG 


rs469099 


ACGTTGGATGTTGGAGCTGCTCTAGTTCTC 


ACGTTGGATGTGAAAACCGGGACTCAGCTC 


rs456868 


ACGTTGGATGACAGAGCAGGGAGCTGCGGT 


ACGTTGGATGATTCACCCCCAGCTACTGTG 


rs465389 


ACGTTGGATGAGGCTTTGTAGACAGCTCCC 


ACGTTGGATGTGCCAGTGCTCTGAGTATGC 


rs463892 


ACGTTGGATGAGGCTTTGTAGACAGCTCCC 


ACGTTG G ATGTGCCAGTGCTCTG AGTATGC 


rs468548 


ACGTTGGATGACTGGAAGGGAACATGCAAG 


ACGTTGGATGCCTGGATGCCCTTTATAGAC 


rs654612 


ACGTTGGATGACTGGAAGGGAACATGCAAG 


ACGTTGGATGTGGATGCCCTTTCTAGACAC 


rs468542 


ACGTTGGATGGCCTCCA I I I I CCTTCTCAC 


ACGTTGGATGTGTCTAGAAAGGGCATCCAG 


rs469262 


ACGTTGGATGTTCTGAGCTGAACGAAGCAG 


ACGTTGGATGGGTCAGGGATCCTTTGATGC 


rs708323 


ACGTTGGATGCACATACTATACAGGTCACC 


ACGTTGGATGGAGGGAGAAGATGTTGTGAA 


rs469089 


ACGTTGGATG I I I GGAAGTACCACCTCAGC 


ACGTTGGATGAATGGAAGGAAGGATCAGCC 


rs469396 


ACGTTGGATGAGTGACTCCAATGAGGGAAC 


ACGTTGGATGTCTCACACCACTGATCCTTC 


rs468723 


ACGTTGGATGTGTGGATCTTGCTGTTTGGG 


ACGTTGGATGTATTGGCATCGCGTATCAGG 


rs467604 


ACGTTGGATGACTCCTGCCATTAAACTCTC 


ACGTTGGATGCTTGGCTTAACTTACAAGGG 


rs338874 


ACG I 1 GGATGCCCCACCACAGCCACTGGG 


ACGTTGGATGAAGGGCCTTGCCCCACCCAA 


rs338875 


ACGTTGGATGTGCTGTCTTGCTCGCGTGTG 


ACGTTGGATGACACTGGATATGTCAGGGTC 


rs1 385803 


ACGTTGGATGTCACCACCATTCCAGAAGTG 


ACGTTGGATGACCTTCCTTATTGCTGTGGC 


rs 1385804 


ACGTTGGATGTCACCACCATTCCAGAAGTG 


ACGTTGGATGACCTTCCTTATTGCTGTGGC 


rs338876 


ACGTTGGATGTTAGGGCTGGGTGGAGGAAG 


ACGTTGGATGTCCAACTCCCAGTGACAGAG 


rs 189803 


ACGTTGGATGCCTCCAGTTTCTCTCTTCTG 


ACGTTGGATGATCCTGGATTAGCCAGATGG 


rs452215 


ACGTTGGATGTAGCTCTATTCTTCCACCCC 


ACGTTGGATGAG CG AGACTCCGTCTCAAAA 


rs641170 


ACGTTGGATGATAGCTCTATTCTTCCACCC 


ACGTTGGATGAGCGAGACTCCGTCTCAAAA 
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dbSNP 
rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


rsoo*+oyo 


Abb I I bbA lull I b 1 UnuO 1 A 1 ftuftAAU 


APPTTPPATPPP^A^APTPPPTPTPAA A A A A A 
Abb 1 1 bbA 1 bbbAbAb 1 bbb 1 b 1 bAAAAAAA 


rsocsDoou 


Mbvb 1 1 bbA loll bOUL/LMML I A 1 lol bb 1 b 


ACG 1 1 GGATGGG I I I CCCAGACAGTG 1 1 1 G 


rs^zyooo 


ACG 1 1 GGATGTA 1 1 ATCTGCAGACACCTGG 


ACG 1 1 GGATGATCTCA 1 1 CCCACCCTC \ 1 C 


rs371229 


ACGTTGGATGTATTATCTGCAGACACCTGG 


AP^TTP,P^ATP4ATPTPATTPPPAPPPTPTTP 
Abb 1 1 bbA 1 bA 1 b 1 bA 1 1 bbbAbbb 1 b 1 1 b 


rs460874 


ACG 1 1 GGATGGTCCTGCGGCTAAAAA 1 I CC 


ACGTTGGATGGGGCAGGTCAACTAGAAAAC 


rs646121 


ACG 1 1 GGATGGGGCAGGTCAACTAGAAAAC 


ACGTTGGATGGTCCTGCGGCTAAAAA 1 1 CC 




A r^fATT("AfA AT^f^rrA A^AATTTA^AA A A A^TTArr 
n^o 1 I bbA 1 ouLOAoo 1 1 1 UbAAAu 1 1 Abb 


APPTTPPATPTPPPTTPPTPATPPPPTAAP 
Abb 1 1 bbA 1 b 1 bbb 1 1 bb 1 bA 1 bbbb 1 AAb 


rs467863 


ACG 1 1 GGATG I 1 I CGAAACCTGGCTGATGG 


ACGTTGGATGTGCCACTGTCAGAAGACAAG 


rs191434 


ACG I 1 GGATGCCAGCTGAAACACTAGACAG 


ACGTTGGATGAGCTGAAGAGGTC 1 1 1 CTCC 


rs2054782 


ACGTTGGATGAAAAAAGCAGGCCTCAGACC 


ACG 1 I GGATGTCTGACTCTCATCTGCAGAC 


rs468499 


ACG 1 1 GGATGCTCCAGGAGGGACACTACGT 


ACGTTGGATGTGGCCAGCTTCTCCTCGATG 


rs1 80287 


ACG I t GGATG I I GTCTGCAGAA 1 I ACCTAT 


ACG 1 1 GGATGGAAAAAGAAAAAAAAATCAG 


rs338877 


ACG 1 1 GGATGCGTGGATGGAAA 1 1 lACAl 1 


ACG 1 1 GGATG 1 1 C I I 1 GGATCAATG I 1 GCC 


rSDDUDO O 


Abb 1 1 bbA 1 bbbbA 1 b 1 1 AO 1 b 1 A 1 bA lb lb 


ACG 1 1 GGATGAAAGTGCTGGGA I I ATAGGC 


rs193419 


ACGTTGGATGGCAAATCCAAAGACACAGGG 


ACGTTGGATGATG Mil CATCACCCCAGTG 


rSl °f 88 


ACGTTGGATGTGTGACCTGGTAGCTTAGAG 


ACGTTGGATG 1 1 GTAGGAGGTCAGAAGAGG 


rs1 86834 


APrTTPrATrTA APPTAPPAPPT^APAPAP 
Abb 1 1 bbA 1 b 1 AAbb 1 AbbAbb 1 bAbAbAb 


ACGTTGGATGAGTTGATAGGAGAGTCAGGC 


rs i oyzou 


APPTTPP ATPTA APPTAPPAPPTPAPAPAP 
Abb 1 1 bbA 1 b 1 AAbb 1 AbbAbb 1 bAbAbAb 


ACGTTGGATGAGTTGATAGGAGAGTCAGGC 


rs1 89267 


ACGTTGGATGCCTCA 1 iGTGCCCTGl iGTG 


ACG I I GGATGCTCTGCCTGACTCTCCTATC 


rs1 70937 


ACGTTGGATGCCTATCAACTGTTGATGGCG 


ACGTTGGATG 1 1 CCTCATTGTGCCCTG 1 1 G 




APPiTTPP ATPTAPTPP APPPPTTTPPAP AP 
Abb 1 1 bbA 1 b 1 Ab 1 bbAbbbb 1 1 1 bbAbAb 


ACG 1 1 GGATGTGCCCATGCTCATGTG 1 1 GG 




ACGTTGGATGTGCCCATGCTCATGTG 1 1 GG 


ACG 1 1 GGATGACCCC 1 1 1 GCACAGATGCTG 


rs460454 


ACGTTGGATGAAGAAGGACCGTGTCAGAGA 


ACGTTGGATGACATGAGCATGGGCAGGTAC 


rs^DUnoo 


APPTTPPATPAPATPAPPATPPPPAPPTAP 

Abb 1 1 bbA 1 bAbA 1 bAbbA 1 bbbbAbb 1 Ab 


ACGTTGGATGAAGAAGGACCGTGTCAGAGA 


rS4DUt)UD 


A PPTTPPATP A PPPTPPAPAPPPTPTPTP A 

Abb 1 1 bbA 1 bAbbb 1 bbAbAbbb Iblbl oA 


ACGTTGGATGTGCTCTGAGGGCAGAACAAG 


rs931 316 


ACGTTGGATGATGCACACACCCATGGTCAG 


ACGTTGGATGCGGTTCACTCCAGCA I 1 I CC 


rs463431 


ACGTTGGATGTCACCACAGCCCATGGGGA 


ACGTTGGATG I 1 I GAAACTCACAATGTGGG 


rs461 542 


ACGTTGGATGTGATGAAGGCCAAGAATGCT 


ACGTTGGATGTGTGTCCAGAACGTCAGGTG 


rs463557 


ACGTTGGATGTGATGAAGGCCAAGAATGCT 


ACGTTGGATGTGTGTCCAGAACGTCAGGTG 


rs1 91453 


ACGTTGGATGCATCCAACAGCTCTGTCTGC 


ACG 1 1 GGATGACCCATCTGTAGCGCATCAG 


rs2271212 


ACGTTGGATGAGCTTCCCCGGAGGCAACGA 


ACGTTGGATGTGCAGGTCTCGGCCAAAGAC 


rs462009 


ACGTTGGATGCAGGCTCCTCCTCGTTGCC 


ACGTTGGATGTTGGTGTCCCACGTGGTGT 


rs2271211 


ACGTTGGATGTCGTACCCCTGCTCTGGACG 


ACGTTGGATGACTGACGCCCAGGGCCGCTT 


rs396474 


ACGTTGGATGTGGGAGTTGGAGATGATGAG 


ACGTTGGATGTTCCTCAGATCCCAGTCAAG 


rs428901 


ACGTTGGATGTCAGTGACAGAGCGAGACTC 


ACGTTGGATGGGGCTCGATAATGTAGCCAT 


rs452300 


ACGTTGGATGAGCACAAGCTGAAGAGGTCT 


ACGTTGGATGAGGAGAGAAGTGCACAGATC 


rs670256 


ACGTTGGATGTAGCTCTATTCTTCCACCCC 


ACGTTGGATGAGCGAGACTCCGTCTCAAAA 



TABLE 12 



dbSNP 




Term 


rs# 




Mix 


rs2278221 


CAAACGCTGAGGAGAAGCC 


ACT 


rs1 650358 


AAGAGACAAAAGGCCGGGC 


ACT 


rs1643818 


TACAGGTGTGAACCACCGC 


ACT 


rs3733916 


AGGCTGTAGTGTTGACAGAC 


ACG 
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dbSNP 


Extend 


Term 


rs# 


Primer 


Mix 


rs 1624933 


GTCTCAAACTCCTGACCTCA 


ACT 


rs1 624857 


AGACCAGCCTGGCCAACAT 


ACT 


rs1 624832 


GGCCAACATGGTGAAACCC 


ACG 


rs 1624829 


TGGCCAACATGGTGAAACCCT 


ACT 


rs2161171 


TG G AAT AAG AG CCCTGCAGTGG 


ACT 


rs1 530499 


CCCCTGCCCCAGCCACAGGAA 


ACT 


rs888764 


ATGTTGTATTGGCTATATTTGTCA 


ACG 


rs873987 


AAAACCTAAAAGAATCCACGGTA 


ACG 


rs4078699 


GACACATGATTAACAGCAAACAAT 


ACT 


rs87031 1 


AAGGGCGTGACGGCCCC 


ACT 


rs1643817 


GAAAGGGGAGAAAAGATTATCCC 


CGT 


rs1643816 


AGGACCAGGAGTTTCCCATTTT 


ACT 


rs1 650355 


GAATCAATGAAGAAGAGAGCTT 


ACT 


rs888763 


GGTCAGGAGGCAGAGGGA 


ACT 


rs1862212 


GGGGTGAAAGGGAGCAGGG 


CGT 


rs1 110514 


CAGGCCCCAGGTGAGGAA 


CGT 


rs3797600 


CTTTGTTGGTTAACCAAACCC 


ACG 


rs3797602 


GCTGACAGCTCCGGACATG 


ACT 


rs3797603 


TGTCATTCTCCTTGTGAACCCTC 


ACT 


rs3776819 


CCATTCCATTGCACCTGCATG 


ACT 


rs252076 


CAAAGTGCTGGGATTGCAGG 


ACG 


rs252075 


GAGCATTTGCAGGCATGCCCTCT 


ACT 


rs252074 


CTGGGTGGCTGCTGGGC 


ACG 


rs252068 


GGAAGGGTTTAAGCAAGGAG 


ACT 


rs252069 


TGAGCACCTACTATGGGCTAG 


ACT 


rs1 94040 


ATTCCATATCTTCAAAGTGATTCA 


ACG 


rs252070 


CCTGGGCTTCCCCTCCC 


ACG 


rs3797606 


AGCCCTTGGCCTCTCTCC 


ACT 


rs1 71667 


CGCCTTTTGCTTATGCAAAGA 


ACG 


rs1 87539 


AACCTTCAGGAAAGTTCCCAT 


ACT 


rs3836834 


TCAAAATATCAAACTACCATGAAA 


ACG 


rs252071 


ACCCTGAGACACAGGGACT 


ACT 


rs252072 


GCTGGGTCACACTCGCGGA 


ACG 


rs252073 


GCCGAGAAAAGTCAGGGATTCT 


ACT 


rs379589 


CGGGAGTACTGAGCACCCAGG 


CGT 


rs2052472 


CCCCACTGTGACTATCTCCAC 


ACT 


rs2052471 


GTCTCCTTTGGCTGCCAAG 


ACT 


rs2052470 


TGCCAAGGCCCTGTCCTC 


ACT 


rs2052469 


CGCGGGGAAGTACTCGGC 


ACT 


rs3797608 


GTCCTCCTGTTCTGAGGCCC 


ACT 


rs3797609 


GGCAGAGCGGATGGCCTG 


ACG 


rs3822601 


GTGAGGCCTGAGATGAGAACC 


ACG 


rs153131 


TCCCCATACTCCTGTGCTC 


ACG 


rs751546 


CCGTTCTACAGCGGTTAAGA 


ACT 



79 



PATENT 

Docket 524593008900 



dbSNP 
rs# 


Extend 
Primer 


Term 
Mix 


rs2279979 


GGCCACCAGACAGATGTAAG 


ACT 


rs252060 


CGTGTTTCGGCAGAGGTGA 


ACT 


rs3797610 


CTTCTCCCTTGGGTGATGTGTT 


ACT 


rs1 94039 


CCACCGIGCCGGGACAl I i 1 I 1 I I 


ACT 


rs1 68773 


ACTGGAGATATCACGGGAGC 


CGT 


rs252061 


CCAGCTGGTCACAGGGCTCCC 


ACG 


rs1 87537 


TCATGCTAAGTGAAATAAGCCA 


ACT 


rs252062 


GCCACCACCGTCCACAGA 


CGT 


rs2431255 


CTGTATATTTCACCGCAATTAAAA 


ACT 


rs3797612 


GGCATTCATCGTCAGGGCAA 


ACG 


rs3797613 


CCGCCGCCGGTCTCCCA 


ACG 


rs614114 


GAACGTTCTCTCACTTTTGCC 


ACT 


rs252063 


CTCCCGTCCTCTGAGCCTT 


ACT 


rs252064 


GAAAACTAAGGCTCAGAGGAC 


ACT 


rs252065 


TGGAAAAGGCGAGGCCTGGAGT 


ACG 


rs450502 


GGAAAAGGCGAGGCCTGGAGTT 


ACT 


rs439252 


GCCTCCCAAAGTGCTGGGATTA 


ACG 


rs252066 


TAGCCCTCTGGAGCCCAG 


ACG 


rs457957 


GGGCCCCTCCTTAAAGCTC 


ACT 


rs3797614 


TGGCCCTCGCTCTAATGCA 


ACG 


rs423552 


CTCGATGTTGTAGTCATCGTC 


ACG 


rs398829 


TGGCGTGCTCCTCTAGGA 


ACG 


rs4 16646 


CTCAGCAGGTCTGATCCATC 


ACT 


rs1 87450 


GGGCAGACTCCCCAGGAT 


ACT 


rs337807 


GCAGGCCACTCGGTGGAC 


ACT 


rs337806 


CCACCCCAGGGGTAGCCC 


ACT 


rs 1396438 


GGCAGGCAGGTGGCCTG 


ACT 


rs1 396437 


CGGCAGAAGCAGCCTCAAGA 


ACG 


rs2411811 


ACATAATTTCCAAATTTCACCCC 


CGT 


rs2898813 


GGTCCTGGGTGGAGGGAT 


ACT 


rs1 89256 


AGCAAGGCTCTATTTGGGGA 


ACT 


rs 173072 


CTTTGCTCACATCGTGGCCAAA 


ACT 


rs337805 


G GAG CAGGAAAATT AC ATG ACT 


ACG 


rs191415 


GAGTCCAACGTTGTCCACAT 


ACT 


rs1 80045 


ACTTGTTTCTACAATTCTCATTC 


ACG 


rs1 89255 


GACTCAGTCCCAGGTTTTCT 


ACT 


rs652766 


AGAAAACCTGGGACTGAGTCT 


ACT 


rs466750 


TCCTTAAATGCCTTGGTTGGCAAT 


ACT 


rs442406 


TTCTGGCTGTTG G GTTTG AAC 


ACT 


rs662407 


AAAC ATCTG AAATT AAAAG C ACC 


ACT 


rs592971 


AGCATTTCTTTGACTGCTCTTTCA 


ACT 


rs457187 


GGAGTATCTGTTCCTTGTGG 


ACT 


rs459490 


TTGAACATAGGAATAACCCGC 


ACT 


rs459668 


GTCTTCTTTTGTG GGAGA 


ACG 
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ATTATTCGACGGAGATTATTTGAC 



ATTA I I I 1 1 CTGTCTGGTGTGG 



CCTCCTCATTGTCATTC I I I I IC 



CTTTCATGACATTGACACAACTAC 



CCACAGCCTCCGCCTCCC 



GGGTTTCCAGGTTAAAATGGC 



CTCCTGCTCTGCCTATCCTT 



ATTTCTTCCCCTCCTGCTCT 



GCCTCCCACAGTTCCCTTGTT 



GTGATGGCTCTGCACCAGA 



AGCGTGTGCCAGCTCTCC 



GACACAATTCAGAGCGACTTAC 



AAGTGCTAGAAGAAAATGTAGC 



CCTTGCGCCATCCCCTAG 



TGTGCCCATCCCCCCCTT 



AACGTAAGTCCTAATGACCGCCC 



CCCTCTCCTCTTCTGGGCA 



GCTTTGGTCCTCCTGAGCC 



CAC AGA GCGACTC TCTCTTGGTT 



CTGACAGTTCTCTAAACTCCCA 



TTCC I I I I I CTCCCTGAAGCA 



CACCCCATATGGCTCATGGG 



GGGAAGGAGGTACTTAGCAG 



GTGCCACTCTCCAGCGGCC 



TTGCCGGCCCCCACCTC 



GAAAATCCCTGTCACCAGTC 



CTTGGCTGCAGCACCCCA 



GCCCTGGTACTTTCAGCTCCCT 



GTGTGCATGTGTGTGCCTG 



TAAACGGGCCAAAAACGCCTAT 



GCGCCTACGGATGTCAGG 



CATGTTGAACAGAGAGAAACGGTC 



TCACTGAGAAATATTTTGCTCCC 



GGTGAGCCCTTGGCTGTG 



T AAAG CG CTCTTAC AAATC AAC A 



TGGTGAAATCCTGTCTCTACTAAA 



CACCGTTTCTTATAATGCAGCC 



GGGGACGTTACTTCTTTTCAC 



ATTTATATATGTGTGTGTACACAT 



CCCATTGGCTGTCCTGGAA 



CCTCAACCCCAGATGCCCTC 



ACTCCTGCCTGAGTGTCTC 



GTGAAGCGGAAGGAGACTC 



CTGCACACCCTCTGCACAG 
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TGGCTGGCACCACTGCACTGC 



TGGCTCATGCTTCTAATCCCA 



CTGC ACTCCAGC CTGG GC 



ACATCAGTGACAGTGTAATGGTA 



TATGCAGACCCCCTCCCC 



AACAACAGCAACAGAAATGAAG 



CGATTCCCACGCGTGTCTG 



CCTGGCTCCATTGGTGTGAA 



CCTTCACACCAATGGAGCCAG 



CTGCCACTAAACAGATGAGAA 



ATTTCTGGGCCCAAAGTCCA 



CCAATTGTTCCAGCCACTCCC 



TGATATTGCTTGCTTGGGTCTTAG 



GGTCAAGAATTCAAGAGCAGC 



GTGCAGTGGCACGATCCTA 



GCAGGTGGAACCGCAGAC 



GGAGCTGCGGTGACTCCC 



CCCTGGCACTCGCAGACC 



AGCTCCCCCCGCACCAC 



AAGGGAACATGCAAGCAAAGACTC 



TGCAAGCAAAGACTCGAATGA 



TCACTCACTTGATTCCTGCCATC 



CACTGTGGGATTTCCAGCAGA 



TATACAGGTCACCCATTTAAAGT 



CCTCGGCCTTCCCCAGCT 



AATGAGGGAACCTGCAGTTTAAGA 



CAGACCCCATGCCTTGCC 



GAGTTTCCTCCTCTTTCACAA 



CACAGCCACTGGGGAGTAG 



TTGCTCGCGTGTGCCAGCAAAT 



AAGTGGAATTCTCATGGCAGAT 



CATTCCAGAAGTGGAATTCTCATG 



AGGAAGGTGCTCCGGCCT 



TGCTTCCCCCTTCCCCCT 



TCTATTCTTCCACCCCCATCTTT 



CTATTCTTCCACCCCCATCT 



CTCTTATATAGCTCTATTCTTCC 



AGGTGTCTGCAGATAATACATT 



CCTGGGGCACAGGACAATA 



GACAATAGTTGGGGCAAGAC 



ACAAAACTATCCTTCAAAAATACA 



G I I I I I GTTTCTCTGAAAGTGTCT 



CACCCAACTACTTGCTCCC 



GCTGATGGGAGGCCAATGT 
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dbSNP 


Extend 


Term 


rs# 


Primer 


Mix 


rs1 91434 


GTCCAGAGATCCTGCTCACT 


CGT 


rs2054782 


CCCCCTCCATCACCTCCC 


ACG 


rs468499 


GTGAGCCAGCAATTCTCCTA 


ACT 


rs1 80287 


CAATGATCAGAACTCAGAGGTTTT 


ACT 


rs338877 


AGAGATAAATTTCCAGTGTGAG 


CGT 


rs650665 


AGACATCCCGGCCGGGC 


ACT 


rs193419 


CCAAAGACACAGGGAGTAGATTA 


ACT 


rs1 80288 


GAGAATATTCTTGTGGGCTTAAT 


ACT 


rs 186834 


CCAGGTCACACACACACTC 


ACG 


rs1 89266 


CACACACTCCCTCTCACTGT 


ACT 


rs 189267 


TTCTGTGCATCTTTGACGCCATC 


CGT 


rs 170937 


GATGGCGTCAAAGATGCACA 


ACT 


rs463263 


CCCCTTTGCACAGATGCTG 


ACT 


rs463262 


GGGGAGCAGCCAGTTCCTA 


ACT 


rs460454 


AGAGGCTGGGGACAGAGAA 


ACT 


rs460455 


GGTACCCACCAGTCTCCTTCT 


ACT 


rs460505 


CAGCGTCTCTGACACGGTC 


ACG 


rs931316 


G GTC AG AG C AG AC AC ATC C AC AT 


ACG 


rs463431 


C C CAT GGGGAGCAC C AAG 


ACT 


rs461542 


TGGGAGCTCCCGGGATATTGCC 


ACG 


rs463557 


GCTCCCGGGATATTGCCCA 


ACT 


rs191453 


CTGGGCTGGGGCCCTGC 


ACT 


rs2271212 


CG AGG AGG AGCCTG GC AG 


ACG 


rs462009 


CTCCTCGTTGCCTCCGGG 


ACT 


rs2271211 


GACGTAGCTGCCGACACCA 


ACG 


rs396474 


CTGGTGGCCCATCTATCCTGG 


ACT 


rs428901 


GAGCGAGACTCCGTCTCAA 


CGT 


rs452300 


CTGAAGAGGTCTTTCTCCTTCC 


CGT 


rs670256 


TTCTTCCACCCCCATCTTTG 


ACT 



Genetic Analysis 

[0237] Allelotyping results from the discovery cohort are shown for cases and controls in Table 13. 
The allele frequency for the A2 allele is noted in the fifth and sixth columns for osteoarthritis case pools 
and control pools, respectively, where "AF" is allele frequency. The allele frequency for the Al allele 
can be easily calculated by subtracting the A2 allele frequency from 1 (Al AF = 1-A2 AF). For example, 
the SNP rs2278221 has the following case and control allele frequencies: case Al (C) = 0.36; case A2 
(T) = 0.64; control Al (C) = 0.37; and control A2 (T) = 0.63, where the nucleotide is provided in 
paranthesis. Some SNPs are labeled "untyped" because of failed assays. 
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TABLE 13 



dbSNP 
rs# 


SEQ ID NO: 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs2278221 


210 


178695460 


C/T 


064" 


6~63 


0 770 


rs1 650358 




178698858 


C/G 








rs1643818 


3609 


178698859 


C/G 








rs3733916 




178699568 


C/T 








rs1 624933 


5593 


178700843 




0~69 


0~71 


0 255 — 


rs 1624857 


5629 


178700879 


— cTT — 


0^79 


(Tsi 


0 574 


rs 1624832 


5639 


178700889 


A/G 


0^41 


(T44 


0 203 — 


rs1 624829 




1 78700890 




0^89 

! 


— oi)3 — 

'■ 


0 044 — 

'■ 


rs2161171 


8943 


178704193 


a7c~ 








rs1 530499 


17968 


178713218 


A/G 


039 

: 


039 

: 


— 0~861 

■ : 


rs888764 


19887 


1 78715137 


A/G 








rs873987 


21034 


178716284 


A/G 








rs4078699 


21085 


178716335 


— c7f — 


056 


0 54 


— 0 374 — 


rs87031 1 


21596 


178716846 


AJG 


0^51 


050 


— 0 590 — 


rs1643817 


23379 


178718629 


— a/c — 


0^27 


NA 


NA 


rs l 6438 16 




178718682 


A/C 








rs1 650355 


24007 


178719257 


A/C 








rs888763 


26121 


1 78721371 




040 


042 


— 0 390 — 


rs1862212 


26273 


178721523 


A?f 


(T55 


(T54 


— 0 753 — 


rs1 1 10514 


26755 


1 78722005 


Aff 


(T29 


0^28 


— 0 572 — 


rs3797600 


27411 


178722661 


C/T 


— oisei — 


0^57 


— 0 738 — 


rs3797602 


27710 


1 78722960 




0^65 

: ! 


0 JL g4 


— 0 564 — 


rs3797603 




1 78723092 


— cTT — 








rs3776819 


28379 


178723629 


C/T 


0.46 


0.46 


0.850 


rs252076 


29603 


178724853 


C/T 


0.46 


0.48 


0.519 


rs252075 


31232 


178726482 


C/G 


0.35 


0.36 


0.859 


rs252074 


31504 


178726754 


A/G 


0.35 


0.34 


0.816 


rs252068 


32583 


178727833 


C/G 


0.47 


0.48 


0.656 


rs252069 


32794 


1 78728044 


A/G 


0.28 


0.27 


0.626 


rs 194040 


32840 


178728090 


C/T 


0.31 


0.32 


0.665 


rs252070 


33044 


1 78728294 


C/T 


0.58 


0.57 


— 0-573 — 


rs3797606 


33150 


1 78728400 


A/C 


0.88 


0.88 


0.684 


rs171667 


33218 


178728468 


A/G 


0.48 


0.51 


0.166 


rs1 87539 


33513 


1 78728763 


C/T 


0.33 


0.34 


0.652 


rSoooDoo4 


33959 


\ lot zyzuy 


AACTAC 
CATGAA 








rs252071 


34486 


178729736 


A/G 


030 


b~3i 


— 0 666 — 


rs252072 


36289 


178731539 


— cTf — 


0^49 

'■ 


0^50 

: 


— 0 677 — 
'■ 


rs252073 




178731820 


C/T 








rs379589 


38247 


1 78733497 




059 


Q-gg 


— 0 096 — 


rs2052472 


38477 


178733727 


A/C 


0.05 


0.06 


0.508 


rs2052471 


38518 


178733768 


C/T 


0.89 


0.88 


0.459 


rs2052470 


38529 


178733779 


C/T 


0.83 


0.80 


0.125 


rs2052469 


38667 


178733917 


A/G 


0.83 


0.80 


0.172 


rs3797608 


39781 


1 78735031 


C/T 


0.06 


0.07 


0.578 


rs3797609 


39856 


178735106 


C/T 


0.05 


0.05 


0.812 


rs3822601 


39927 


178735177 


C/T 


0.08 


0.08 


0.802 


rs153131 


40506 


1 78735756 


A/G 


0.76 


0.77 


0.944 


rs751546 


41869 


178737119 


C/G 


0.93 


0.92 


0.585 


rs2279979 


42452 


178737702 


C/T 


0.93 


0.92 


0.436 


rs252060 


44788 


1 78740038 


C/T 


0.81 


0.82 


0.760 


rs3797610 


46059 


178741309 


A/C 


0.17 


0.17 


0.858 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs 194039 


46846 


178742096 


A/G 


0.41 


0.47 


0.035 


rs1 68773 


47712 


178742962 


A/T 


0.35 


0.38 


0.266 


rs252061 


48796 


178744046 


err 


0.21 


0.19 


0.508 


rs 187537 


49441 


178744691 


C/G 








rs252062 


49602 


178744852 


A/T 


0.95 


0.95 


0.960 


rs2431255 


49723 


178744973 


A/C 


0.24 


0.19 


0.034 


rs3797612 


50050 


178745300 


err 


0.38 


0.43 


0.036 


rs3797613 


50171 


178745421 


err 


0.21 


0.21 


0.941 


rs614114 


50477 


178745727 


err 


0.50 


0.53 


0.387 


rs252063 


50818 


178746068 


err 


0.57 


0.55 


0.313 


rs252064 


50833 


178746083 


err 


0.52 


0.52 


0.806 


rs252065 


50881 


178746131 


A/G 


0.22 


0.22 


0.857 


rs450502 


50882 


178746132 


A/G 








rs4 39252 


51386 


178746636 


err 








rs252066 


51534 


178746784 


C/T 


0.19 


0.18 


0.618 


rs457957 


52317 


178747567 


A/G 


0.67 


0.70 


0.172 


rs3797614 


52368 


178747618 


C/T 








rs423552 


52970 


178748220 


A/G 


0.90 


0.92 


0.215 


rs398829 


53023 


178748273 


A/G 








rs4 16646 


53356 


178748606 


A/G 


0.56 


0.57 


0.650 


rs 187450 


53882 


178749132 


G/T 








rs337807 


54553 


178749803 


CfT 


0.55 


0.59 


0.208 


rs337806 


55475 


178750725 


A/C 


0.11 


0.10 


0.925 


rs1 396438 


55530 


178750780 


A/G 


0.56 


0.54 


0.494 


rs 1396437 


55691 


178750941 


crr 








rs2411811 


55848 


178751098 


A/C 








rs2898813 


55879 


178751129 


C/G 








rs 189256 


56316 


178751566 


A/G 


0.19 


0.19 


0.988 


rs1 73072 


56911 


178752161 


A/C 








rs337805 


57320 


178752570 


A/G 


0.25 


0.24 


0.657 


rs191415 


57391 


178752641 


C/T 








rs1 80045 


57437 


178752687 


C/T 


0.51 


0.47 


0.211 


rs 189255 


57478 


178752728 


C/G 


0.15 


0.12 


0.273 


rs652766 


57500 


178752750 


C/T 


0.57 


0.61 


0.213 


rs466750 


59111 


178754361 


GAT 


0.35 


0.33 


0.493 


rs442406 


59333 


178754583 


A/G 


0.57 


0.59 


0.420 


rs662407 


59715 


178754965 


A/G 


0.31 


0.27 


0.102 


rs592971 


59804 


178755054 


A/G 








rs457187 


59851 


178755101 


A/G 


0.23 


0.24 


0.842 


rs459490 


59929 


178755179 


C/T 


0.21 


0.20 


0.604 


rs459668 


60052 


178755302 


C/T 


0.20 


0.19 


0.648 


rs462646 


60240 


178755490 


C/T 


0.43 


0.43 


0.905 


rs458272 


60359 


178755609 


C/T 


0.22 


0.20 


0.523 


rs463455 


60381 


178755631 


A/G 


0.25 


0.24 


0.644 


rs675880 


60456 


178755706 


err 


0.63 


0.65 


0.591 


rs810617 


60724 


178755974 


C/G 








rs464156 


60875 


178756125 


C/T 


0.34 


0.34 


0.892 


rs458083 


60968 


178756218 


A/G 


0.80 


0.82 


0.499 


rs467333 


60978 


178756228 


C/G 


0.11 


0.12 


0.369 


rs465381 


60998 


178756248 


C/T 








rs466363 


61557 


178756807 


C/T 


0.31 


0.34 


0.358 


rs2457099 


62091 


178757341 


C/T 


0.44 


0.44 


0.956 


rs463901 


62645 


178757895 


C/T 


0.43 


0.45 


0.395 


rs465621 


62943 


178758193 


A/C 


0.62 


0.63 


0.534 


rs463724 


63131 


178758381 


A/T 


0.09 


0.08 


0.523 


rs465242 


63145 


178758395 


G/T 








rs467419 


63406 


178758656 


A/G 


0.65 


0.66 


0.647 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs456135 


63427 


178758677 


C/G 


0.79 


0.80 


0.686 


rs464536 


63554 


178758804 


C/T 


0.36 


0.34 


0.296 


rs461898 


63661 


178758911 


A/G 


0.30 


0.32 


0.411 


rs389558 


64093 


178759343 


A/G 


0.24 


0.26 


0.325 


rs466752 


64153 


178759403 


C/T 


0.35 


0.37 


0.446 


rs455655 


64409 


178759659 


C/G 


0.87 


0.89 


0.536 


rs463435 


64544 


178759794 


C/T 


0.68 


0.66 


0.428 


rs21 74971 


65257 


178760507 


C/T 


0.52 


0.51 


0.695 


rs 1979979 


65626 


178760876 


A/G 


0.07 


0.06 


0.692 


rs411804 


65739 


178760989 


A/G 


0.78 


0.78 


0.976 


rs 1623885 


66392 


178761642 


C/T 


0.82 


0.80 


0.492 


rs1 643811 


66720 


178761970 


C/T 


0.24 


0.24 


0.924 


rs434430 


69177 


178764427 


A/T 








rs 187538 


69336 


178764586 


G/T 








rs252067 


69636 


1 78764886 


A/G 


0.21 


0.23 


0.606 


rs459319 


69823 


178765073 


A/G 


0.19 


0.20 


0.640 


rs467289 


69928 


178765178 


C/T 


0.26 


0.26 


0.988 


rs462644 


70547 


178765797 


C/T 


0.59 


0.58 


0.914 


rs458752 


70633 


178765883 


C/T 


0.18 


0.20 


0.513 


rs708320 


71805 


178767055 


A/C 








rs457954 


72181 


178767431 


C/G 


0.71 


0.73 


0.327 


rs24 11810 


72200 


178767450 


C/T 


0.28 


0.26 


0.252 


rs3084687 


72474 


178767724 


-/AT 


0.13 


0.12 


0.884 


rs69638 


72567 


178767817 


C/G 


0.54 


0.52 


0.449 


rs455452 


72973 


178768223 


A/G 


0.59 


0.60 


0.733 


rs464850 


73468 


178768718 


A/G 


0.11 


0.09 


0.249 


rs431472 


73889 


178769139 


A/G 


0.33 


0.34 


0.713 


rs2411809 


75730 


178770980 


C/T 








rs2457094 


75970 


178771220 


A/G 


0.71 


0.73 


0.383 


rs2457095 


76114 


178771364 


A/G 


0.74 


0.76 


0.551 


rs2261740 


76342 


178771592 


C/T 


0.35 


L 0.36 


0.702 


rs1 109180 


76449 


178771699 


A/G 








rs1 109179 


76465 


178771715 


C/T 








rs1 109178 


76791 


178772041 


A/C 


0.46 


0.45 


0.820 


rs4 56909 


78042 


178773292 


A/G 


0.55 


0.53 


0.444 


rs4691 24 


80758 


178776008 


A/G 








rs468039 


80778 


178776028 


C/T 








rs467017 


81356 


178776606 


A/C 


0.33 


0.32 


0.665 


rs469290 


81576 


178776826 


A/G 


0.57 


0.57 


0.871 


rs469090 


81689 


178776939 


C/T 


0.82 


0.83 


0.387 


rs469568 


81759 


178777009 


G/T 


0.38 


0.38 


0.888 


rs468386 


81950 


178777200 


C/G 








rs469349 


82562 


178777812 


A/C 








rs469099 


83591 


178778841 


C/T 


0.66 


0.63 


0.264 


rs456868 


83700 


178778950 


A/G 








rs465389 


83821 


178779071 


C/G 








rs463892 


83842 


178779092 


C/G 








rs468548 


83923 


178779173 


G/T 








rs654612 


83929 


178779179 


A/C 








rs468542 


84021 


178779271 


C/G 








rs469262 


84175 


178779425 


C/T 


0.45 


0.47 


0.405 


rs708323 


84417 


178779667 


A/G 


0.73 


0.69 


0.138 


rs469089 


84747 


178779997 


C/G 








rs469396 


85746 


178780996 


C/G 


0.38 


0.37 


0.817 


rs468723 


86129 


178781379 


C/T 


0.37 


0.38 


0.754 


rs467604 


86335 


178781585 


A/G 


0.34 


0.32 


0.504 


rs338874 


87315 


178782565 


C/G 


0.43 


0.44 


0.879 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs338875 


87648 


178782898 


A/G 


0.48 


0.50 


0.289 


rs1 385803 


87764 


178783014 


A/C 








rs 1385804 


87770 


178783020 


C/G 








rs338876 


88221 


178783471 


C/T 


0.39 


0.39 


0.889 


rs1 89803 


90474 


178785724 


A/C 








rs452215 


91148 


178786398 


G/T 








rs641170 


91150 


178786400 


G/T 








rs584398 


91160 


178786410 


G/T 








rs385330 


91733 


178786983 


C/T 








rs429538 


91772 


178787022 


A/C 








rs371229 


91785 


178787035 


C/T 








rs460874 


93140 


178788390 


A/T 


0.74 


0.71 


0.351 


rs646121 


93148 


178788398 


A/T 


0.93 


0.94 


0.687 


rs468262 


96080 


178791330 


A/G 








rs467863 


96157 


178791407 


C/G 








rs1 91434 


96313 


178791563 


A/C 








rs2054782 


96759 


178792009 


C/T 


0.44 


0.42 


0.353 


rs468499 


97026 


178792276 


A/C 








rs1 80287 


97320 


178792570 


C/G 








rs338877 


97732 


178792982 


A/T 


0.04 


0.04 


0.863 


rs650665 


98713 


178793963 


C/G 








rs193419 


99707 


178794957 


A/C 








rs1 80288 


99959 


178795209 


C/G 








rs1 86834 


100009 


178795259 


A/G 








rs 189266 


100020 


178795270 


C/G 








rs1 89267 


100065 


178795315 


A/C 








rs1 70937 


100086 


178795336 


C/G 








rs463263 


101270 


178796520 


C/G 








rs463262 


101276 


178796526 


G/T 








rs460454 


101371 


178796621 


C/T 








rs460455 


101376 


178796626 


C/G 








rs460505 


101439 


178796689 


C/T 








rs931316 


101820 


178797070 


C/T 








rs463431 


102392 


178797642 


C/G 








rs461542 


102602 


178797852 


A/G 








rs463557 


102604 


178797854 


A/C 








rs1 91453 


102896 


178798146 


C/T 


0.11 


0.14 


0.123 


rs2271212 


189104 


178884354 


C/T 


0.65 


0.57 


0.003 


rs462009 


189134 


178884384 


C/T 








rs2271211 


189205 


178884455 


A/G 








rs396474 


Not mapped 


Not mapped 


A/C 








rs428901 


Not mapped 


Not mapped 


A/T 


0.64 


0.72 


0.015 


rs452300 


Not mapped 


Not mapped 


G/T 








rs670256 


Not mapped 


Not mapped 


G/T 









[0238] The ADAMTS2 proximal SNPs were also allelotyped in the replication cohorts using the 
methods described herein and the primers provided in Tables 1 1 and 12. The replication allelotyping 
results for replication cohort #1 and replication cohort #2 are provided in Tables 14 and 15, respectively. 
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TABLE 14 



dbSNP 
rs# 


SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs2278221 


210 


178695460 


C/T 


0.64 


0.62 


— 0 624 — 


rs1 650358 


3608 


178698858 


C/G 








rs1643818 


3609 


1 78698859 










rs3733916 


4318 


1 78699568 


C/T 








rs 1624933 




178700843 




b~65 


069" 


— 0 322 — 


rs 1624857 


5629 


178700879 


— cTf — 


— oiri — 


tv — 


— Fja — 


rs 1624832 


5639 


1 78700889 


A/G 


fT^ft 


U o 4? 


— (Toec — 


rs1 624829 


5640 


178700890 


— cTT — 


087 

: 


tv — H — 

IY£ 


NA 


rs21 61 1 71 


8943 


178704193 


A/C 









rs1 530499 


17968 


178713218 


A/G 


039 

, : 


b~40 

'■ 


(V7RS 

: 


rs888764 


19887 


178715137 


A/G 








rs873987 


21034 


178716284 


A/G 








rs4078699 


21085 


178716335 


— cTf — 


055 


054 


0 733 — 


rs87031 1 


21596 


178716846 


A/G 


0^50 


(T50 


0 828 — 


rs 16438 17 


23379 


178718629 


A/C 


0^27 

: 


tv — H 

— ""type 




rs1643816 


23432 


178718682 


A/C 








rs1 650355 


24007 


178719257 


A/C 








rs888763 


26121 


178721371 




040 


040 


0 816 — 


rs1862212 


26273 


178721523 


A/T 


0 55 


0.55 


0.936 


rs1 1 10514 


26755 


178722005 


A/T 


0.29 


0.29 




rs3797600 


27411 


178722661 


C/T 






— 0 604 — 


rs3797602 




178722960 




064~ 

: 


0^63 

' 


— 0 879 — 

! 


rs3797603 


27842 


178723092 


C/T 








rs3776819 


28379 


178723629 


C/T 


0.47 


0.46 


0.889 


rs252076 


29603 


178724853 


C/T 


0.46 


0.49 


0.410 


rs252075 


31232 


178726482 


C/G 


0.35 


0.37 


0.572 


rs252074 


31504 


178726754 


A/G 


0.35 


0.35 


0.914 


rs252068 


32583 


178727833 


C/G 


0.48 


0.48 


0.853 


rs252069 


32794 


178728044 




0.29 


0.28 


0.765 


rs 194040 


32840 


178728090 


C/T 


0.31 


0.33 


0.450 


rs252070 


33044 


178728294 


C/T 


0.57 


0.58 


0.609 


rs3797606 


33150 


1 78728400 


A/C 


0-87 


0J91 


— 0.119 — 


rs171667 


33218 


178728468 


A/G 


0.45 


0.50 


0.125 


rs 187539 


33513 


178728763 


Q£L 


0.33 


0.34 


0.709 


rsoooooo** 


33959 




" 

AACTAC 
CATGAA 








rs252071 


34486 


1 78729736 


A/G 


030 


b~32 


— 0 566 — 


rs252072 


36289 


178731539 


— cTf — 


0^48 

: 


— oiri — 

: 


— 0 400 — 

: 


rs252073 




178731820 


— c7T — 








rs379589 


38247 


1 78733497 


— aTt — 


059 


b~65 


— 0 035 — 


rs2052472 


38477 


178733727 


A/C 


0.04 


0.06 


0.493 


rs2052471 


38518 


178733768 


C/T 


0.87 


0.88 


0.697 


rs2052470 


38529 


178733779 


err 


0.84 


0.78 


0.036 


rs2052469 


38667 


178733917 


A/G 


0.84 


0.79 


0.086 


rs3797608 


39781 


178735031 


C/T 


0.06 


0.07 


0.530 


rs3797609 


39856 


178735106 


C/T 


0.04 


0.05 


0.841 


rs3822601 


39927 


178735177 


C/T 


0.08 


0.08 


0.904 


rs153131 


40506 


178735756 


A/G 


0.77 


0.77 


0.964 


rs751546 


41869 


178737119 


C/G 


0.94 


0.92 


0.265 


rs2279979 


42452 


178737702 


C/T 


0.94 


0.92 


0.238 


rs252060 


44788 


178740038 


C/T 


0.82 


0.80 


0.553 


rs3797610 


46059 


178741309 


A/C 


0.16 


0.18 


0.459 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs 194039 


46846 


178742096 


A/G 


0.43 


0.45 


0.589 


rs 168773 


47712 


178742962 


A/T 


0.34 


0.35 


0.845 


rs252061 


48796 


178744046 


C/T 


0.23 


0.22 


0.884 


rs 187537 


49441 


178744691 


C/G 








rs252062 


49602 


178744852 


A/T 


0.98 


0.96 


0.310 


rs2431255 


49723 


178744973 


A/C 


0.24 


0.19 


0.108 


rs3797612 


50050 


178745300 


C/T 


0.42 


0.46 


0.254 


rs3797613 


50171 


178745421 


C/T 


0.19 


0.21 


0.576 


rs614114 


50477 


178745727 


C/T 


0.52 


0.54 


0.717 


rs252063 


50818 


1 78746068 


C/T 


0.55 


0.57 


0.537 


rs252064 


50833 


178746083 


C/T 


0.52 


0.50 


0.609 


rs252065 


50881 


178746131 


A/G 


0.21 


0.25 


0.234 


rs450502 


50882 


178746132 


A/G 








rs439252 


51386 


178746636 


C/T 








rs252066 


51534 


178746784 


C/T 


0.20 


0.20 


0.883 


TS457957 


52317 


178747567 


A/G 


0.66 


0.71 


0.162 


rs3797614 


52368 


178747618 


C/T 








rs423552 


52970 


1 78748220 


A/G 


0.90 


0.92 


0.380 


rs398829 


53023 


178748273 


A/G 








rs4 16646 


53356 


178748606 


A/G 


0.58 


0.59 


0.915 


rs 187450 


53882 


178749132 


GfT 








rs337807 


54553 


178749803 


C/T 


0.60 


NA 


NA 


rs337806 


55475 


178750725 


A/C 


0.10 


0.10 


0.997 


rs1 396438 


55530 


178750780 


A/G 


0.52 


0.57 


0.188 


rs1 396437 


55691 


178750941 


C/T 








rs2411811 


55848 


178751098 


A/C 








rs2898813 


55879 


178751129 


C/G 








rs1 89256 


56316 


178751566 


A/G 


0.21 


0.20 


0.852 


rs1 73072 


56911 


178752161 


A/C 








rs337805 


57320 


178752570 


A/G 


0.24 


0.24 


0.950 


rs191415 


57391 


178752641 


C/T 








rs 180045 


57437 


1 78752687 


C/T 


0.47 


0.46 


0.918 


rs 189255 


57478 


178752728 


C/G 


0.14 


0.13 


0.764 


rs652766 


57500 


1 78752750 


C/T 


0.59 


0.61 


0.570 


rs466750 


59111 


178754361 


GfT 


0.38 


0.37 


0.606 


rs442406 


59333 


178754583 


A/G 


0.56 


0.57 


0.882 


rs662407 


59715 


178754965 


A/G 


0.32 


0.27 


0.134 


rs592971 


59804 


178755054 


A/G 








rs457187 


59851 


178755101 


A/G 


0.23 


0.25 


0.451 


rs459490 


59929 


178755179 


C/T 


0.22 


0.21 


0.671 


rs459668 


60052 


178755302 


C/T 


0.20 


0.19 


0.712 


rs462646 


60240 


178755490 


C/T 


0.42 


0.44 


0.439 


rs458272 


60359 


178755609 


G/T 


0.21 


0.21 


0.755 


rs463455 


60381 


178755631 


A/G 


0.25 


0.25 


0.783 


rs675880 


60456 


178755706 


C/T 


0.62 


0.63 


0.741 


rs810617 


60724 


1 78755974 


C/G 








rs464156 


60875 


178756125 


C/T 


0.32 


0.34 


0.541 


rs458083 


60968 


178756218 


A/G 


0.80 


0.82 


0.499 


rs467333 


60978 


178756228 


C/G 


0.10 


0.13 


0.243 


rs465381 


60998 


178756248 


C/T 








rs466363 


61557 


178756807 


C/T 


0.31 


0.34 


0.494 


rs2457099 


62091 


178757341 


C/T 


0.45 


0.45 


0.997 


rs463901 


62645 


178757895 


C/T 


0.46 


0.46 


0.852 


rs465621 


62943 


178758193 


A/C 


0.64 


0.63 


0.853 


rs463724 


63131 


178758381 


A/T 


0.09 


0.08 


0.737 


rs465242 


63145 


178758395 


G/T 








rs467419 


63406 


178758656 


A/G 


0.64 


0.65 


0.694 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs456135 


63427 


178758677 


C/G 


0.79 


0.76 


0.339 


rs464536 


63554 


178758804 


C/T 


0.36 


0.34 


0.553 


rs461898 


63661 


178758911 


A/G 


0.31 


0.33 


0.727 


rs389558 


64093 


178759343 


A/G 


0.27 


0.28 


0.762 


rs466752 


64153 


178759403 


err 


0.34 


0.38 


0.223 


rs455655 


64409 


178759659 


C/G 


0.87 


untyped 


NA 


rs463435 


64544 


1 78759794 


C/T 


0.65 


0.65 


0.973 


rs21 74971 


65257 


178760507 


err 


0.49 


0.51 


0.476 


rs1 979979 


65626 


178760876 


A/G 


0.08 


0.07 


0.579 


rs4 11804 


65739 


178760989 


A/G 


0.77 


0.79 


0.420 


rs 1623885 


66392 


178761642 


err 


0.81 


0.78 


0.451 


rs 1643811 


66720 


178761970 


C/T 


0.26 


0.25 


0.715 


rs434430 


69177 


178764427 


A/T 








rs1 87538 


69336 


178764586 


G/T 








rs252067 


69636 


178764886 


A/G 


0.22 


0.22 


0.978 


rs459319 


69823 


1 78765073 


A/G 


0.19 


0.22 


0.245 


rs467289 


69928 


178765178 


C/T 


0.26 


0.29 


0.377 


rs462644 


70547 


178765797 


C/T 


0.58 


0.56 


0.637 


rs458752 


70633 


178765883 


C/T 


0.18 


0.23 


0.129 


rs708320 


71805 


178767055 


A/C 








rs457954 


72181 


178767431 


C/G 


0.69 


0.73 


0.143 


rs2411810 


72200 


178767450 


C/T 


0.28 


0.23 


0.083 


rs3084687 


72474 


178767724 


-/AT 


0.12 


0.13 


0.767 


rs69638 


72567 


178767817 


C/G 


0.53 


0.49 


0.157 


rs455452 


72973 


178768223 


A/G 


0.58 


0.61 


0.313 


rs464850 


73468 


178768718 


A/G 


0.13 


0.10 


0.171 


rs431472 


73889 


178769139 


A/G 


0.32 


0.39 


0.048 


rs2411809 


75730 


178770980 


C/T 








rs2457094 


75970 


178771220 


A/G 


0.70 


0.75 


0.157 


rs2457095 


76114 


178771364 


A/G 


0.74 


0.75 


0.707 


rs2261740 


76342 


178771592 


C/T 


0.34 


untyped 


NA 


rs1 109180 


76449 


178771699 


A/G 








rs1 109179 


76465 


178771715 


C/T 








rs1 109178 


76791 


178772041 


A/C 


0.47 


0.48 


0.715 


rs456909 


78042 


178773292 


A/G 


0.56 


0.54 


0.537 


rs469124 


80758 


178776008 


A/G 








rs468039 


80778 


178776028 


C/T 








rs467017 


81356 


178776606 


A/C 


0.33 


0.31 


0.480 


rs469290 


81576 


178776826 


A/G 


0.63 


0.66 


0.427 


rs469090 


81689 


178776939 


C/T 


0.80 


0.83 


0.300 


rs469568 


81759 


178777009 


G/T 


0.39 


0.43 


0.234 


rs468386 


81950 


178777200 


C/G 








rs469349 


82562 


178777812 


A/C 








rs469099 


83591 


178778841 


C/T 


0.66 


0.60 


0.066 


rs4 56868 


83700 


178778950 


A/G 








rs465389 


83821 


178779071 


C/G 








rs463892 


83842 


178779092 


C/G 








rs468548 


83923 


178779173 


G/T 








rs654612 


83929 


178779179 


A/C 








rs4 68542 


84021 


178779271 


C/G 








rs469262 


84175 


178779425 


C/T 


0.46 


0.50 


0.232 


rs708323 


84417 


178779667 


A/G 


0.72 


0.66 


0.071 


rs469089 


84747 


178779997 


C/G 








rs469396 


85746 


178780996 


C/G 


0.37 


0.35 


0.522 


rs468723 


86129 


178781379 


C/T 


0.39 


0.41 


0.495 


rs467604 


86335 


178781585 


A/G 


0.33 


0.30 


0.303 


rs338874 


87315 


178782565 


C/G 


0.44 


0.46 


0.628 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


F p- 
Value 


rs338875 


87648 


178782898 


A/G 


0.49 


0.54 


0.106 


rs 1385803 


87764 


178783014 


A/C 








rs 1385804 


87770 


178783020 


C/G 








rs338876 


88221 


178783471 


err 


0.38 


0.36 


0.609 


rs1 89803 


90474 


178785724 


A/C 








rs452215 


91148 


178786398 


G/T 








rs641170 


91150 


178786400 


G/T 








rs584398 


91160 


178786410 


G/T 








rs385330 


^ 91733 


178786983 


C/T 








rs429538 


91772 


178787022 


A/C 








rs371229 


91785 


178787035 


C/T 








rs460874 


93140 


178788390 


A/T 


0.74 


0.69 


0.118 


rs646121 


93148 


178788398 


A/T 


0.93 


0.95 


0.477 


rs468262 


96080 


178791330 


A/G 








rs467863 


96157 


178791407 


C/G 








rs191434 


96313 


178791563 


A/C 








rs2054782 


96759 


178792009 


C/T 


0.45 


0.42 


0.514 


rs468499 


97026 


178792276 


A/C 








rs 180287 


97320 


1 78792570 


C/G 








rs338877 


97732 


178792982 


A/T 


0.04 


0.04 


0.781 


rs650665 


98713 


178793963 


C/G 








rs193419 


99707 


178794957 


A/C 








rs 180288 


99959 


178795209 


C/G 








rs1 86834 


100009 


178795259 


A/G 








rs1 89266 


100020 


178795270 


C/G 








rs 189267 


100065 


178795315 


A/C 








rs 170937 


100086 


178795336 


C/G 








rs463263 


101270 


178796520 


C/G 








rs463262 


101276 


178796526 


G/T 








rs460454 


101371 


178796621 


C/T 








rs460455 


101376 


178796626 


C/G 








rs460505 


101439 


178796689 


C/T 








rs931316 


101820 


178797070 


C/T 








rs463431 


102392 


178797642 


C/G 








rs461542 


102602 


178797852 


A/G 








rs463557 


102604 


178797854 


A/C 








rs191453 


102896 


178798146 


C/T 


0.15 


0.19 


0.139 


rs2271212 


189104 


178884354 


C/T 


0.64 


0.58 


0.072 


rs462009 


189134 


178884384 


C/T 








rs2271211 


189205 


178884455 


A/G 








rs396474 


Not mapped 


Not mapped 


A/C 








rs428901 


Not mapped 


Not mapped 


A/T 


0.66 


untyped 


NA 


rs452300 


Not mapped 


Not mapped 


G/T 








rs670256 


Not mapped 


Not mapped 


G/T 









TABLE 15 



dbSNP 
rs# 


Position in 
SEQ ID NO: 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs2278221 


210 


178695460 


C/T 


0.64 


0.64 


0.837 


rs1 650358 


3608 


178698858 


C/G 








rs 16438 18 


3609 


178698859 


C/G 








rs3733916 


4318 


178699568 


C/T 








rs1 624933 


5593 


178700843 


A/G 


0.73 


0.75 


0.447 


rs1 624857 


5629 


178700879 


C/T 


0.78 


0.81 


0.289 
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dbSNP 

rs# 


Position in 
SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs1 624832 


5639 


178700889 


A/G 


0.44 


0.47 


0.423 


rs 1624829 


5640 


178700890 


C/T 


0.90 


0.93 


0.294 


rs2161171 


8943 


178704193 


A/C 








rs1 530499 


17968 


178713218 


A/G 


0.39 


0.36 


0.499 


rs888764 


19887 


178715137 


A/G 








rs873987 


21034 


178716284 


A/G 








rs4078699 


21085 


178716335 


C/T 


0.57 


0.54 


0.316 
0.579 


rs870311 


21596 


178716846 


A/G 


0.52 


0.50 


rs1643817 


23379 


178718629 


A/C 








rs1643816 


23432 


178718682 


A/C 








rs1 650355 


24007 


178719257 


A/C 








rs888763 


26121 


178721371 


A/G 


0.40 


0.44 


0.264 


rs1862212 


26273 


178721523 


A/T 


0.56 


0.53 


0.529 


rs1 110514 


26755 


178722005 


A/T 


0.30 


0.27 


0.381 


rs3797600 


27411 


178722661 


C/T 


0.55 


0.54 


0.840 


rs3797602 


27710 


178722960 


G/T 


0.68 


0.65 


0.534 


rs3797603 


27842 


178723092 


C/T 








rs3776819 


28379 


178723629 


C/T 


0.45 


0.47 


0.662 


rs252076 


29603 


178724853 


C/T 


0.46 


0.46 


0.986 


rs252075 


31232 


178726482 


C/G 


0.36 


0.34 


0.666 


rs252074 


31504 


178726754 


A/G 


0.35 


0.33 


0.604 


rs252068 


32583 


178727833 


C/G 


0.47 


0.48 


0.648 


rs252069 


32794 


178728044 


A/G 


0.27 


0.26 


0.640 


rs 194040 


32840 


178728090 


C/T 


0.31 


0.30 


0.734 


rs252070 


33044 


178728294 


C/T 


0.61 


0.55 


0.157 


rs3797606 


33150 


178728400 


A/C 


0.91 


0.83 


0.005 


rs171667 


33218 


178728468 


A/G 


0.51 


0.52 


0.674 


rs1 87539 


33513 


178728763 


C/T 


0.32 


0.33 


0.836 


rs3836834 


33959 


178729209 


/TATCA 
AACTAC 
CATGAA 
A 








rs252071 


34486 


178729736 


A/G 


0.30 


0.30 


0.942 


rs252072 


36289 


178731539 


C/T 


0.50 


0.49 


0.684 


rs252073 


36570 


178731820 


C/T 








rs379589 


38247 


178733497 


A/T 


0.60 


0.61 


0.981 


rs2052472 


38477 


178733727 


A/C 


0.06 


0.06 


0.856 


rs2052471 


38518 


178733768 


C/T 


0.91 


0.86 


0.079 


rs2052470 


38529 


178733779 


C/T 


0.82 


0.83 


0.828 


rs2052469 


38667 


178733917 


A/G 


0.82 


0.82 


0.983 


rs3797608 


39781 


178735031 


C/T 


0.06 


0.06 


0.969 


rs3797609 


39856 


178735106 


C/T 


0.05 


0.05 


0.879 


rs3822601 


39927 


178735177 


C/T 


0.07 


0.08 


0.838 


rs153131 


40506 


178735756 


A/G 


0.76 


0.76 


0.981 


rs751546 


41869 


178737119 


C/G 


0.91 


0.92 


0.526 


rs2279979 


42452 


178737702 


C/T 


0.92 


0.92 


0.906 


rs252060 


44788 


178740038 


C/T 


0.81 


0.85 


0.157 


rs3797610 


46059 


178741309 


A/C 


0.18 


0.16 


0.593 


rs1 94039 


46846 


178742096 


A/G 


0.39 


0.49 


0.005 


rs 168773 


47712 


178742962 


A/T 


0.37 


0.43 


0.098 


rs252061 


48796 


178744046 


C/T 


0.19 


0.15 


0.164 


rs1 87537 


49441 


178744691 


C/G 








rs252062 


49602 


178744852 


A/T 


0.93 


0.95 


0.290 


rs2431255 


49723 


178744973 


A/C 


0.23 


0.19 


0.201 


rs3797612 


50050 


178745300 


C/T 


0.32 


0.38 


0.102 


rs3797613 


50171 


178745421 


C/T 


0.23 


NA 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs614114 


50477 


178745727 


C/T 


0.48 


0.51 


0.423 


rs252063 


50818 


178746068 


C/T 


0.60 


0.51 


0.011 


rs252064 


50833 


178746083 


C/T 


0.51 


0.56 


0.265 


rs252065 


50881 


178746131 


A/G 


0.22 


0.18 


0.175 


rs450502 


50882 


178746132 


A/G 








rs439252 


51386 


178746636 


C/T 








rs252066 


51534 


178746784 


C/T 


0.18 


0.16 


0.451 


rs457957 


52317 


1 78747567 


A/G 


0.67 


0.68 


0.728 


rs3797614 


52368 


178747618 


C/T 








rs423552 


52970 


178748220 


A/G 


0.89 


0.91 


0.398 


rs398829 


53023 


178748273 


A/G 








rs4 16646 


53356 


178748606 


A/G 


0.54 


0.55 


0.643 


rs 187450 


53882 


178749132 


G/T 








rs337807 


54553 


178749803 


C/T 


0.49 


0.59 


0.009 


rs337806 


55475 


178750725 


A/C 


0.11 


0.10 


0.889 


rs 1396438 


55530 


178750780 


A/G 


0.61 


0.50 


0.007 


rs1 396437 


55691 


178750941 


C/T 








rs2411811 


55848 


178751098 


A/C 








rs2898813 


55879 


178751129 


C/G 








rs 189256 


56316 


178751566 


A/G 


0.17 


0.17 


0.923 


rs 173072 


56911 


178752161 


A/C 








rs337805 


57320 


178752570 


A/G 


0.27 


0.25 


0.582 


rs191415 


57391 


178752641 


C/T 








rs 180045 


57437 


178752687 


C/T 


0.56 


0.48 


0.115 


rs 189255 


57478 


178752728 


C/G 


0.16 


0.12 


0.168 


rs652766 


57500 


178752750 


C/T 


0.55 


0.61 


0.231 


rs466750 


59111 


178754361 


G/T 


0.31 


0.28 


0.473 


rs442406 


59333 


178754583 


A/G 


0.58 


0.63 


0.209 


rs662407 


59715 


178754965 


A/G 


0.30 


0.28 


0.449 


rs592971 


59804 


178755054 


A/G 








rs457187 


59851 


178755101 


A/G 


0.23 


0.21 


0.402 


rs459490 


59929 


178755179 


C/T 


0.20 


0.19 


0.708 


rs459668 


60052 


178755302 


C/T 


0.21 


0.20 


0.821 


rs462646 


60240 


178755490 


C/T 


0.44 


0.41 


0.460 


rs458272 


60359 


178755609 


G/T 


0.22 


0.20 


0.524 


rs463455 


60381 


178755631 


A/G 


0.23 


0.22 


0.629 


rs675880 


60456 


178755706 


C/T 


0.65 


0.67 


0.564 


rs810617 


60724 


1 78755974 


C/G 








rs464156 


60875 


178756125 


C/T 


0.37 


0.34 


0.439 


rs458083 


60968 


178756218 


A/G 








rs467333 


60978 


178756228 


C/G 


0.11 


0.11 


0.902 


rs465381 


60998 


178756248 


C/T 








rs466363 


61557 


178756807 


C/T 


0.32 


0.34 


0.547 


rs2457099 


62091 


178757341 


C/T 


0.43 


0.43 


0.974 


rs463901 


62645 


178757895 


C/T 


0.39 


0.43 


0.342 


rs465621 


62943 


178758193 


A/C 


0.59 


0.64 


0.195 


rs463724 


63131 


178758381 


A/T 


0.09 


0.07 


0.539 


rs465242 


63145 


178758395 


G/T 








rs467419 


63406 


178758656 


A/G 


0.66 


0.67 


0.752 


rs456135 


63427 


178758677 


C/G 


0.79 


0.85 


0.029 


rs464536 


63554 


178758804 


C/T 


0.36 


0.32 


0.332 


rs461898 


63661 


178758911 


A/G 


0.28 


0.31 


0.423 


rs389558 


64093 


178759343 


A/G 


0.20 


0.23 


0.311 


rs466752 


64153 


178759403 


C/T 


0.36 


0.35 


0.781 


rs455655 


64409 


178759659 


C/G 


NA 


0.72 


NA 


rs463435 


64544 


178759794 


C/T 


0.72 


0.68 


0.230 


rs21 74971 


65257 


178760507 


C/T 


0.56 


0.51 


0.142 
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dbSNP 

rs# 


Position in 
SEQ ID NO: 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs 1979979 


65626 


178760876 


A/G 


005 


005 


— 0 993 — 


rs41 1804 


65739 


178760989 


A/G 


0^80 


0T7 


— 0 343 — 


rs 1623885 


66392 


178761642 







084 


— 0 819 — 


rs 164381 1 




178761970 


— cTT — 


022 

'■ 


0^23 

: 


— 0 847 — 

'- 


rs434430 


KQ177 


178764427 


— aTt — 








rs1 87538 


69336 


1 78764586 


rff 








rs252067 


69636 


1 78764886 


~Air 

a/P 


cTo^ 


rToI 


— — — — - — 
— n 0^0 — 


rs459319 


69823 


1 78765073 




niR 


015 




rs467289 


69928 


1 787651 78 


rTf 


K77 


Ci~77 


— n i7Q — 


rS^fDZDn^ 




\ 1 01 00 r y / 


F7f 


rTfin 


tTki 


— n Rr>Q — 


rs4 58752 


vncoo 


1 78765883 


rn 


ni« 

'- 


015 

'■ 


— n 971 — 




rs708320 


7iftnK 


1 78767055 


A~7r 








rs457954 


79ifti 


1 78767431 


F7F 


rv79 


rT79 


— rTRR9 — 
— nfifn — 


rs241 1810 


7oonn 


1 78767450 


rrr 


rTpQ 


rrv) 




rs3084687 


72474 


178767724 


Tat 


CM3 


oTi 





rs69638 





1 7876781 7 


'tit 


rT^4 


rT^7 


— ri aaci — 


rs455452 


79Q7-J 


1 78768223 


ait 


OR?) 


0^58 


— 0 499 — 


rs464850 


7-3Afi« 


178768718 


ait 


— (TTo — 


cTOQ 


— n «qo — 


rs431472 


70QOQ " 


17A7RQ-I9.0. 

\ / 0 f oy i oy 


"aJt 


pr^F 

: 


Koi 


— 0 025 — 

! 


rs241 1809 


75730 


1 78770980 


— err — 




— 




rs2457094 


7597Q 


178771220 


A/G 


OTl 


070 


0792 — 


rs2457095 


76114 


178771364 


ait 


rp7^ 


rT7R 


— n fin? — 


rs2261740 


76342 — " 


1 78771 592 


— erf — 


0~36 

: 


6^36 

'■ 


4 — 

: 


rs1 109180 


76449 


1 78771 699 


A/G 








rs1 1091 79 


' 7fi4fiS 


178771715 


rvf 








rs1 1091 78 


7R7Q1 


1 78772041 


aTt 


rTZ^ 

Tl^f 


tTZt 


— cTZ?c\ — 


rs^ooyuy 


70049 


] / Of I JZyZ 


a77^ 


: 


n^i 

051 


— n cqr — 
— 0.598 — 


rs469124 


anvco 


1 78776008 


Air 








rs468039 


Rn77P 


1 78776028 


rvf 








rs46701 7 


aiufi 


1 7W77RROR 
I f Of f DDUD 


Air 


rv5i" 


rv^ 


— (Ttm — 


rs^oy^yu 


QHC7R 


1 7R77£P.9f; 


aTF 


rTIu^ 

nnl 


rTZZ 


— n 990 — 


rs469090 


QHRQQ 


1 78776939 


rrr 

T^Tf 


rT^S 


ntvi 

<T^k 


noii 


rs469568 


Q 7 O 

olqcn 


1 78777009 




036 


O30 


n c 

— 0.115 — 


rc/lKfilRK 

rs^fDoooD 




-1 7R7779nn 

\ 1 0 1 f l ^uu 


rir 








rs^oyo^y 


QOCRO 


1 7fi777fl1 O 











rs4oyuyy 


83591 _ 


17fl77flR l H 


7=7f 


PTr^ 

0J55 


— -~ 

037 


— 0-550 — 


rs456868 


QOQO? 


1 78778950 


iF 










rs465389 




1 78779071 










rs^uooy^ 


00049 


1 7P.770.nQ9 

wo/ f yuy^ 




7^7t 








rs468548 


33923 


1 787791 73 










rs65461 2 


83929 


178779179 


aTF 








rs468542 


|4ril 


178779271 


7vr 








rs469262 




178779425 


tTt 


045 


043 


— 0~762 


rs708323 


a^ii7 


1 78779667 


a/t 


cTtZ. 




cTTa 

: 


— n «qq — 
: 


rs469089 


04747 


1 78779997 


rlr^ 








rs469396 


0C74C 


1 78780996 


rTF 


rv^Q 


tTJt 


fT^RQ 


rs468723 


RPi19Q 


1 78781 379 


rrf 


r^kR 


rP^i 


— n c 7 o — 


rs467604 


86335 


178781585 


A^G 


0^35 


036" 


— (T763 — 


rs338874 


87315 


178782565 


C/G 


0.42 


0.40 


0.564 


rs338875 


87648 


178782898 


A/G 


0.46 


0.45 


0.701 


rs1 385803 


87764 


178783014 


A/C 








rs1 385804 


87770 


178783020 


C/G 








rs338876 


88221 


178783471 


C/T 


0.41 


0.44 


0.580 


rs1 89803 


90474 


178785724 


A/C 








rs452215 


91148 


178786398 


G/T 








rs641170 


91150 


178786400 


G/T 








rs584398 


91160 


178786410 


G/T 
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dbSNP 

rs# 


SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs385330 


91733 


178786983 


C/T 








rs429538 


91772 


178787022 


A/C 








rs371229 


91785 


178787035 


C/T 








rs460874 


93140 


178788390 


A/T 


0.73 


0.75 


0.550 


rs646121 


93148 


178788398 


A/T 


0.93 


0.92 


0.697 


rs468262 


96080 


178791330 


A/G 








rs467863 


96157 


178791407 


C/G 








rs191434 


96313 


178791563 


A/C 








rs2054782 


96759 


178792009 


C/T 


0.43 


0.40 


0.473 


rs468499 


97026 


178792276 


A/C 








rs1 80287 


97320 


178792570 


C/G 








rs338877 


97732 


1 78792982 


A/T 


0.04 


0.04 


0.928 


rs650665 


98713 


178793963 


C/G 








rs193419 


99707 


178794957 


A/C 








rs1 80288 


99959 


178795209 


C/G 








rs1 86834 


100009 


1 78795259 


A/G 








rs1 89266 


100020 


178795270 


C/G 








rs 189267 


100065 


178795315 


A/C 








rs1 70937 


100086 


178795336 


C/G 








rs463263 


101270 


178796520 


C/G 








rs463262 


101276 


178796526 


G/T 








rs460454 


101371 


178796621 


C/T 








rs460455 


101376 


178796626 


C/G 








rs460505 


101439 


178796689 


C/T 








rs931316 


101820 


178797070 


C/T 








rs463431 


102392 


178797642 


C/G 








rs461542 


102602 


178797852 


A/G 








rs463557 


102604 


178797854 


A/C 








rs191453 


102896 


178798146 


C/T 


0.06 


0.06 


0.929 


rs2271212 


189104 


178884354 


C/T 


0.66 


0.56 


0.012 


rs462009 


189134 


178884384 


C/T 








rs2271211 


189205 


178884455 


A/G 








rs396474 


Not mapped 


Not mapped 


A/C 








rs428901 


Not mapped 


Not mapped 


A/T 


0.61 


0.72 


0.002 


rs452300 


Not mapped 


Not mapped 


G/T 








rs670256 


Not mapped 


Not mapped 


G/T 









[0239] Allelotyping results were considered particularly significant with a calculated p-value of less 
than or equal to 0.05 for allelotype results. These values are indicated in bold. The allelotyping p-values 
were plotted in Figure 1 for the discovery cohort. The position of each SNP on the chromosome is 
presented on the x-axis. The y-axis gives the negative logarithm (base 10) of the p-value comparing the 
estimated allele in the case group to that of the control group. The minor allele frequency of the control 
group for each SNP designated by an X or other symbol on the graphs in Figure 1 can be determined by 
consulting Table 13. For example, the left-most X on the left graph is at position 178695460. By 
proceeding down the Table from top to bottom and across the graphs from left to right the allele 
frequency associated with each symbol shown can be determined. 

[0240] To aid the interpretation, multiple lines have been added to the graph. The broken horizontal 
lines are drawn at two common significance levels, 0.05 and 0.01. The vertical broken lines are drawn 
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every 20kb to assist in the interpretation of distances between SNPs. Two other lines are drawn to 
expose linear trends in the association of SNPs to the disease. The light gray line (or generally bottom- 
most curve) is a nonlinear smoother through the data points on the graph using a local polynomial 
regression method (W.S. Cleveland, E. Grosse and W.M. Shyu (1992) Local regression models. Chapter 
8 of Statistical Models in S eds J.M. Chambers and T.J. Hastie, Wadsworth & Brooks/Cole.). The black 
line provides a local test for excess statistical significance to identify regions of association. This was 
created by use of a lOkb sliding window with lkb step sizes. Within each window, a chi-square 
goodness of fit test was applied to compare the proportion of SNPs that were significant at a test wise 
level of 0.01, to the proportion that would be expected by chance alone (0.05 for the methods used here). 
Resulting p-values that were less than 10 s were truncated at that value. 

[0241] Finally, the exons and introns of the genes in the covered region are plotted below each 
graph at the appropriate chromosomal positions. The gene boundary is indicated by the broken 
horizontal line. The exon positions are shown as thick, unbroken bars. An arrow is place at the 3' end of 
each gene to show the direction of transcription. 

Example 5 
Effect of ADAMTS2 Polypeptides on 
Biosynthesis of Type II Collagen in Patients with OA 

[0242] To investigate the effect of ADAMTS2 polypeptides on Type II collagen biosynthesis and 
processing, human articular cartilage from OA patients undergoing joint replacement is harvested, 
dissected and maintained as described by Nelson et al. (1998) supra. Type II procollagen levels in 
osteoarthritic patients and autopsy controls is determined by radioimmunoassay (RIA) as previously 
described. Allelic variations {e.g., rs398829) are determined for the OA patients and controls by 
genotyping (See Examples 1 and 2). As type II procollagen is processed by ADAMTS2, increased levels 
of Type II procollagen in individuals with the allelic variation associated with OA demonstrates that this 
variation leads to reduced procollagen processing activity and ultimately to OA. 

Example 6 

Effect of ADAMTS2 Polypeptides on Type II Collagen Processing Activity 

[0243] To investigate the effect of ADAMTS2 polypeptide variants on ADAMTS2 collagen 
processing activity, recombinant polypeptides encompassing the ADAMTS2 variation of SEQ ID NO: 2 
at position 733 and a wild-type ADAMTS2 polypeptide are expressed in cell lines such as chondrocytes. 
Since the allelic variation of ADAMTS2 at position 733 of SEQ ID: NO: 2 will prevent the conversion of 
the ADAMTS2 pro-enzyme to the catalytically active enzyme, processing of ADAMTS2 pro-enzyme is 
monitored by SDS-PAGE analysis followed by Western Blotting using antibodies to ADAMTS2 and 
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methods common to someone skilled in the art. Reduced levels of pro-enzyme cleavage are apparent by 
the increased levels of immunopositive protein of higher molecular weight than of the cleaved active 
protein. 

Example 7 

In Vitro Production of Target Polypeptides 

[0244] cDNA is cloned into a pIVEX 2.3-MCS vector (Roche Biochem) using a directional cloning 
method. A cDNA insert is prepared using PCR with forward and reverse primers having 5' restriction 
site tags (in frame) and 5-6 additional nucleotides in addition to 3' gene-specific portions, the latter of 
which is typically about twenty to about twenty-five base pairs in length. A Sal I restriction site is 
introduced by the forward primer and a Sma I restriction site is introduced by the reverse primer. The 
ends of PCR products are cut with the corresponding restriction enzymes {i.e., Sal I and Sma I) and the 
products are gel-purified. The pIVEX 2.3-MCS vector is linearized using the same restriction enzymes, 
and the fragment with the correct sized fragment is isolated by gel-purification. Purified PCR product is 
ligated into the linearized pIVEX 2.3-MCS vector and E. coli cells transformed for plasmid 
amplification. The newly constructed expression vector is verified by restriction mapping and used for 
protein production. 

[0245] E. coli lysate is reconstituted with 0.25 ml of Reconstitution Buffer, the Reaction Mix is 
reconstituted with 0.8 ml of Reconstitution Buffer; the Feeding Mix is reconstituted with 10.5 ml of 
Reconstitution Buffer; and the Energy Mix is reconstituted with 0.6 ml of Reconstitution Buffer. 0.5 ml 
of the Energy Mix was added to the Feeding Mix to obtain the Feeding Solution. 0.75 ml of Reaction 
Mix, 50 n\ of Energy Mix, and 10 ug of the template DNA is added to the E. coli lysate. 

[0246] Using the reaction device (Roche Biochem), 1 ml of the Reaction Solution is loaded into the 
reaction compartment. The reaction device is turned upside-down and 10 ml of the Feeding Solution is 
loaded into the feeding compartment. All lids are closed and the reaction device is loaded into the 
RTS500 instrument. The instrument is run at 30°C for 24 hours with a stir bar speed of 150 rpm. The 
prVEX 2.3 MCS vector includes a nucleotide sequence that encodes six consecutive histidine amino 
acids on the C-terminal end of the target polypeptide for the purpose of protein purification. Target 
polypeptide is purified by contacting the contents of reaction device with resin modified with Ni 2+ ions. 
Target polypeptide is eluted from the resin with a solution containing free Ni 2+ ions. 
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Example 8 

Cellular Production of Target Polypeptides 

[0247] Nucleic acids are cloned into DNA plasmids having phage recombination cites and target 
polypeptides are expressed therefrom in a variety of host cells. Alpha phage genomic DNA contains 
short sequences known as attP sites, and E. coli genomic DNA contains unique, short sequences known 
as attB sites. These regions share homology, allowing for integration of phage DNA into E. coli via 
directional, site-specific recombination using the phage protein Int and the E. coli protein EHF. 
Integration produces two new att sites, L and R, which flank the inserted prophage DNA. Phage excision 
from E. coli genomic DNA can also be accomplished using these two proteins with the addition of a 
second phage protein, Xis. DNA vectors have been produced where the integration/excision process is 
modified to allow for the directional integration or excision of a target DNA fragment into a backbone 
vector in a rapid in vitro reaction (Gateway™ Technology (Invirrogen, Inc.)). 

[0248] A first step is to transfer the nucleic acid insert into a shuttle vector that contains attL sites 
surrounding the negative selection gene, ccdB (e.g. pENTER vector, Invirrogen, Inc.). This transfer 
process is accomplished by digesting the nucleic acid from a DNA vector used for sequencing, and to 
ligate it into the multicloning site of the shuttle vector, which will place it between the two attL sites 
while removing the negative selection gene ccdB. A second method is to amplify the nucleic acid by the 
polymerase chain reaction (PCR) with primers containing attB sites. The amplified fragment then is 
integrated into the shuttle vector using Int and IHF. A third method is to utilize a topoisomerase- 
mediated process, in which the nucleic acid is amplified via PCR using gene-specific primers with the 5' 
upstream primer containing an additional CACC sequence (e.g., TOPO® expression kit (Invirrogen, 
Inc.)). In conjunction with Topoisomerase I, the PCR amplified fragment can be cloned into the shuttle 
vector via the attL sites in the correct orientation. 

[0249] Once the nucleic acid is transferred into the shuttle vector, it can be cloned into an expression 
vector having attR sites. Several vectors containing attR sites for expression of target polypeptide as a 
native polypeptide, N-fusion polypeptide, and C-fusion polypeptides are commercially available (e.g. , 
pDEST (Invitrogen, Inc.)), and any vector can be converted into an expression vector for receiving a 
nucleic acid from the shuttle vector by introducing an insert having an attR site flanked by an antibiotic 
resistant gene for selection using the standard methods described above. Transfer of the nucleic acid 
from the shuttle vector is accomplished by directional recombination using Int, IHF, and Xis (LR 
clonase). Then the desired sequence can be transferred to an expression vector by carrying out a 
one hour incubation at room temperature with Int, IHF, and Xis, a ten minute incubation at 37°C with 
proteinase K, transforming bacteria and allowing expression for one hour, and then plating on selective 
media. Generally, 90% cloning efficiency is achieved by this method. Examples of expression vectors 
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are pDEST 14 bacterial expression vector with att7 promoter, pDEST 15 bacterial expression vector with 
a T7 promoter and a N-terminal GST tag, pDEST 17 bacterial vector with a T7 promoter and a N- 
terminal polyhistidine affinity tag, and pDEST 12.2 mammalian expression vector with a CMV promoter 
and neo resistance gene. These expression vectors or others like them are transformed or transfected into 
cells for expression of the target polypeptide or polypeptide variants. These expression vectors are often 
transfected, for example, into murine-transformed a adipocyte cell line 3T3-L1, (ATCC), human 
embryonic kidney cell line 293, and rat cardiomyocyte cell line H9C2. 

[0250] Modifications may be made to the foregoing without departing from the basic aspects of the 
invention. Although the invention has been described in substantial detail with reference to one or more 
specific embodiments, those of skill in the art will recognize that changes may be made to the 
embodiments specifically disclosed in this application, yet these modifications and improvements are 
within the scope and spirit of the invention, as set forth in the claims which follow. All publications or 
patent documents cited in this specification are incorporated herein by reference as if each such 
publication or document was specifically and individually indicated to be incorporated herein by 
reference. 

[0251] Citation of the above publications or documents is not intended as an admission that any of 
the foregoing is pertinent prior art, nor does it constitute any admission as to the contents or date of these 
publications or documents. U.S. patents and other publications referenced herein are hereby incorporated 
by reference. 

Nucleotide and Amino Acid Sequence Embodiments 

[0252] Table A includes information pertaining to the incident polymorphic variant associated with 
osteoarthritis identified herein. Public information pertaining to the polymorphism and the genomic 
sequence that includes the polymorphism are indicated. The genomic sequences identified in Table A 
may be accessed at the http address www.ncbi.nih.gov/entrez/query.fcgi, for example, by using the 
publicly available SNP reference number (e.g., rs398829). The chromosome position refers to the 
position of the SNP within NCBFs Genome Build 34, which may be accessed at the following http 
address: www.ncbi.nlm.nih.gov/mapview/map_search.cgi?chr=hum_chr.inf&query=. The "Contig 
Position" provided in Table A corresponds to a nucleotide position set forth in the contig sequence (see 
"Contig Accession No."), and designates the polymorphic site corresponding to the SNP reference 
number. The sequence containing the polymorphisms also may be referenced by the "Nucleotide 
Accession No." set forth in Table A. The "Sequence Identification" corresponds to cDNA sequence that 
encodes associated target polypeptides {e.g., ADAMTS2). The position of the SNP within the cDNA 
sequence is provided in the "Sequence Position" column of Table A. If the SNP falls within an exon, the 
corresponding amino acid position (and amino acid change, if applicable) is provided as well. Also, the 
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allelic variation at the polymorphic site and the allelic variant identified as associated with osteoarthritis 
is specified in Table A. All nucleotide and polypeptide sequences referenced and accessed by the 
parameters set forth in Table A are incorporated herein by reference. 



Table A 



RS ID 


Chrom- 


Chrom 
Position 


Contig 
Accession No. 
HI 


Contig 
Position 


Nucleotide 
Accession 
No. [2] 


Sequence 
Position 


Amino 
Acid 
Position 


Locus 


ID 


A [3] 


Allelic 
Vari- 
ability 


OA 
Assoc. 
Allele 


rs398829 


5 


178748273 


Hs5_77500_34:3 


1729878 


NM_014244 


coding- 
nonsynon 


V245I 


ADAMTS2 


9509 


R 


IG/A1 


G 



[1] Contig Accession Number which can be found in the NCBI Database: 
http address: www.ncbi.nih.gov/entrez/query.fcgi 

[2] Sequence Identification or Nucleotide Accession Number which can be found in the NCBI 
Database: 

http address: www.ncbi.nih.gov/entrez/query.fcgi 

[3] "A" column is the sequence orientation ("F" is forward, "R" is reverse). 

[0253] The following is a genomic nucleotide sequence for an ADAMTS2 region. The following 
nucleotide representations are used throughout: "A" or "a" is adenosine, adenine, or adenylic acid; "C" or 
"c" is cytidine, cytosine, or cytidylic acid; "G" or "g" is guanosine, guanine, or guanylic acid; "T" or "t" is 
thymidine, thymine, or thymidylic acid; and "I" or "i" is inosine, hypoxanthine, or inosinic acid. Exons 
are indicated in italicized lower case type, introns are depicted in normal text lower case type, and 
polymorphic sites are depicted in bold upper case type. SNPs are designated by the following 
convention: "R" represents A or G, "M" represents A or C; " W" represents A or T; "Y" represents C or 
T; "S" represents C or G; "K" represents G or T; "V" represents A, C or G; "H" represents A, C, or T; 
"D" represents A, G, or T; "B" represents C, G, or T; and "N" represents A, G, C, or T. 

ADAMTS2 Genomic Sequence (SEP ID NO. 1) 

>5 : 1786 95251-178 884 700 

1 gggcctcggg ccagcactgc ccagcgctgg gaagacagga gaccacaccc caagtggctt 

61 tgacacaggg cgtgtccctc ttaaggcaca gaggagaagt gggcagccag ggctggggat 

121 cctagggtgg cccctctgtg ccccaccctt ccccaggcca cttacacgtg gccagtctca 

181 tgggccacca caaacgctga ggagaagccR ccctcatggt tcagggtgca gctgcggacc 

241 ggatggcaca tgccggtgac aggagcatag cctgggagga gacaagaggc ggctccagat 

3 01 gctgccatag cctggccggg aaggtgaggc ctggcctaac tcccaggcgc tgcttctcct 
361 caaggcccca atgccctctc taccacaggc aactgccccc ggctggcaca gctcagaaga 

4 21 caccacagca gacagctcat agcctgtgac atctggctga cagcaggccc ccagcccgtc 
481 accacacaag ccccgtggcc gttctgcctg taacagccac actgcagctg gggccctctg 
541 cttagtcaga tgtcacctgc tatggcttgt ctgacacctg actttcctgc ctgccgttct 
601 ctttcacgcc acctgcctct gccccctgcc tgcctcactg agtggcccct gagcagcacc 
661 tgcttccaca cctctctctg ctagaggcat cctgcccccc gtctgcctgg tatcacaccc 
721 tccaagccca cctcctctgg ggggcgtttc ttgggtctcc ccgtgtttct ctggggtcct 
781 tgagacccta gctagacccg tttccataat gccgcagaag gctgcaaagc tgttcagccg 
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841 atgggccaca ctaccctgca gaatatccca gcagggggct ggctggtgcg gactgggagg 

901 tcagagccgc cgtagaagat gaggccagtg gcctacgaca gcccagggcc agggggaggg 

961 tggcagaagt tcatgacgca gagaccccac gtcaaactga ggaagatgga taatgagacc 

1021 tattcacttc cccagctcct gtctcagcag agggaggcgg cttggaaaga tagggaagtg 

1081 cctacagggc tatgggtgcc acacgaggtc tctgcttgta cccctgggag tcttcctaga 

1141 ggcctgccaa gcatggagcc ctgactttca gaaccccaac ctgccctcac tcctgccatg 

1201 ccccactgag agaccctgag gccacgcgtt ccacagagaa gggaaggagg gccactggga 

1261 ccccgggtga gctggcggtg tacgcgccta aaaatagaag catcgtgatc taacccttgt 

1321 cttttcccgt tgtaataaac aggaaaatct gcattttttg ctctgcagat tttgagacga 

1381 gcgtggagtc tgtgtgcctg agtggaacgt tgccatctcc tctttgccct ttgtatgcca 

1441 ggcttgcaga tgcatctgcc tccgtgggag gctttccatt catcctactc agcattacag 

1501 gagcctgcga gatctgaaag cgcatcttta tgtttttctt tgttttcttc cctcttattt 

1561 tactttgtgg ctgttttctc Cctccggaat cttattagat agagtgtgcc tcctcgatgg 

1621 agggtctgtc tcctaacttt tctctcatgt taatcttttt gtcttttgct cccattggga 

1681 gatggccttg actttttatt tcagctttcg atttctttct tttttttttg agatggagtc 

1741 tcactctgcc gtccaggctg gagtgcagtg gtgtgatctc ggctcactgc aacctccgcc 

1801 tcctgggttc agtccattct actgcctcag cctcctgagc agctgggact acaggcaccc 

1861 gccaccatgc tcagctaatt tttgtatttt tactagagac gggatttcac catgttagcc 

1921 aggatggtct ccatctcctg acctcgtgac ctgcccgcct cggcctccca cagtgctggg 

1981 attacaggcg tgagccaccg cgccccgtct acctttcgaa ttttttactt cccccatcct 

2041 atccataatt ttatacaaat ggtttgttgt tctctaattt ttcatagaaa actcttaggg 

2101 tttatggatg tattttctca aacattctgg gaatactact tttaaaaaca tgcttttctt 

2161 gtctctaaat Catctctatt ttctcttaga gccagttttc aaactttttg ttttgtttta 

2221 catcttggtc catctttttc atgctgtaag gatttcttgt gttttggttt ttttaaaata 

2281 tctggtaacc cttggctgtc ctttcatttt ttcttttttt tttttttttt tttgagacag 

2341 agtcttgctc tgttgcccag gctggagtgc agtggcacaa tctcggctca ctgcaacctc 

2401 tgcctccggg gttctagtga ttttcctgcc tcagcctcct gagtagctgg gattacaggc 

2461 acctgccact acactggcta atttttgtgt ttttagtaga gatgttggcc aggctggtca 

2521 cgaactcctg acctcaagtg atccgcccgc cttggcctcc caaagtgctg ggattacagg 

2581 agtgagccac cacacccagc caatttcaca tttttaaatg agggatcagg tcacttaata 

2641 taaaagctgt gtcggtttcc cctcctaggg gatagctctg ctccccatta gtgccctccc 

2701 caagacaaaa cagccaacag cactctcatc acatggtcag ggcaacacct ggctgcgctg 

2761 tgtttgaggg gagctatcca ctcctcaccc agacatggta gtttcagtct ttttaaccta 

2821 gctatgctat taggtgcgtc ttgatatttc attgtgtttt taaattttcc tttttttttt 

2881 ttcttgtaag acaaggcctc gctctgtcac ccaggctgga gtgcagtggt gtgatcatag 

2941 ctcacttcct gggctcaaac aatcctccca cctcaggttc ccgagtggct gggactacag 

3001 gcatccacca ccatgcccag cctcattgtt ttaattgcat ttttccaatg tctagtgagg 

3061 ttgagtgcct tttcatatgt ttattgctca ttcatatttt gtcaagagcc tgataagagt 

3121 ctcctgccca ttatctgact ggattgtctt tattatcctt attatgattt gtaggcgttc 

3181 ttcacatact tcagatataa atcctttttt ataaatattt cccccagtgg attagcgtgc 

3241 ctttaatgca caaacttttg aaatgtgcta ctatttatca gttttttcca ttgagttcat 

3301 ggtttgtttg gttttttttg tctctttttt ttttgagaca gagtctcgca ctgtctccca 

3361 ggctggagca aagtggtgcg atctcgactc attgcaaact ccgcctcccg ggttcacacc 

3421 attctcctgc ctcagcctcc cgggtagctg ggactacagg cgcccgccac catgccctgc 

3481 taattttttt gtatttttat tagagacggg gtttcaccat gttagccaag atggtctcga 

3541 tctcctgacc tcgtgatcta cctgcctccg actcccaaag tgctgggatt acaggtgtga 

3601 accaccgSSc ccggcctttt gtctcttttt aaagaaaaca tctcttactc caagaacatg 

3661 gagccatcca cctacgttat cttctttcat tattttgcgt ttcttattta gacctaaaat 

3721 ccatgtgaga tgctctctgt ctatagagtg agtggaggtg acaggcctcc tttctttcca 

3781 aattggcccc acaccatttg ttaggctgtt ttcctcctct tctctgaaat gtcacttttg 

3841 tgacacatta gcgtccctat acacataggt ccaattcagg acactggatt ctcttctaat 

3901 gatgtatttg tccatcagtg cactgcccca caccctcacc cagctaaggt ggccacggcc 

3961 ttccctgccc tgacactctt ccctgggctg ggccaaggct cccggggccc cttgcatggc 

4021 caggggctca tttccagctt cacgcctctg tggccctgca tgggtgaggc tcaccttgca 

40 81 tgccggaagg cccaaagtcc tgccgtgtga ggaagatggc gtgatcgtgg tattcatcgt 

4141 ggcccgtgtc tggcttctgc tggaggtagg cccagcggca gacattctcc aggctctgag 

4201 aggggttccc gatctcgatg aggctcatgg actgcagggg gatggagaga aatggaagag 

4261 agaggttggc gcctggtggc ccagtggggg gccgagcagg ctgtagtgtt gacagacYcc 

4321 atccactggg aatctgttcc aggtggtaca aaggccctgc ccggtcactc tcaggacctg 

4381 ctcccactgg gctccagtct ttgacaggga agaaaagttt ccccagtcac actcacctgc 

4441 gtgagtttcc acacaaagtc accgcagttt tcaggtgaga tgcagaagcc aagctctacg 

4501 tcatctaagg cctcgcatag tcccaccgtc taccccattt ccagaagtcc ctgtgtttgt 

4561 tctatgaggc acctgttccg tgcaaggcct ccacagaagg caaatgtcat gtggacctca 

4621 ccccaaagag ctcacagttt tgggagggtc cgtggcaggg gaggcggaaa ttccagtgtg 
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4681 gtgtgcaagg ccagaggaag cagatcacct tatacgttcc ccccgtgtga aggtacttgc 

4741 agaaaccaac acaatggagg aaaagaaaca gcaactgaca tatttggtgg gtgcaaaagt 

4801 aatcgcagta ttgccatgaa aagtaatggc aaagactttg gcatcaacct aatagaaaaa 

4861 cagccatatc aagaacaagg gcatacacag atgattccca caggaactat gcatgggaag 

4921 cccccgggaa ccatcccatt ccattttgta tcatgacgca cacctgacag ctagggtggc 

4981 ctcgggaccg ctcgtcttgc acactctctc tgcaccatga catcagcagg cactcccatt 

5041 tagacttcag gactaagaag cctgtcttgg acaggaaact gagactcaga gaagttaaga 

5101 agcttgccca ggtcatgaaa ctaggaagta ggagaattga gatcggatcc caagaaggct 

5161 gaaaccaaaa tctagactca actcactgcc ctaaacttcc tccttaaata agacatacac 

5221 aatatcattt caaaagcaag gcttctgtga gcggcaccct tccttttttc ctccacaatc 

5281 ctcacagcag gcattttggt ttctgttctc accatagact ccaaacaccg aagtagttcc 

5341 ctccaagaga accgagagcc cctctgagtc ctgggcatgt tcttttcccc tttaaggatt 

5401 tatctggcca taaaatagag atgtgcaccc cttggaagct acgtatgcag gtgttttttc 

5461 tcttgcataa atgtatccac aaagtagagc cacgtatttc tctttgtgtc tataaataac 

5521 aaaaagttgg ccgtgcgcgg tggctcacac ctgtaatccc agcactttgg gaggctgagg 

5581 caggtggatc acYtgaggtc aggagtttga gaccagcctg gccaacatRg tgaaacccYR 

5641 tctctactaa aaatgcaaaa cttagtctgg cttggtggcg taatcccagc aacctgggag 

5701 gctgaggaag gagaatcgct tgaacccggg aggtggaggt tgcagtgagc tgagatcacg 

5761 ccactgcact ccagcctagg cgacagagtg agactccgtc tcaaaacaaa acaaaacaaa 

5821 caacaaaggg tattgttagc aagccagtta gttccagaag aagggagcac aattcaaggc 

5881 taggtaagac aagtcgttcc caggggtccc agtcaatgac tgacaagatg gcagaccaat 

5941 tcccaagtcc caagcacaga gttaattgta aagactttat ttcctgctcc tgggagataa 

6001 gataactcat tctaacaagg tttccatgag tcccgatgaa ataagtcata aatcagagtg 

6061 tgatgtcctg tgactaagac ttgtgggaca ttggcataga agtggacaga ccagaggaag 

6121 ctggaggagg gcaaggggag gtgcacttcc tgttcccatc atgaccagga agagcatgtg 

6181 actagctctg gccaatgagc tgtgagcaaa agcggggcat gtctcttttc ggttggagca 

6241 tctaatttcc cattgagacc cttcagagag ctctcctttc tggcacgaga tctggcaaca 

6301 taaagacagt ggctgctctg tcacctagga cctgagtgac gatgagaccc gaatcctccg 

6361 ccaacccaca ttgaacatac agcgtgagca ggaaatcttg gtcgttttaa gccactggga 

6421 tgcaaagtgt gcttgttcct gtagcataac ctcacctact ataattgata caggaggaag 

6481 ggagtgggta tctgcatcac atccaagcct gacccctgca gctcatctcg agtccctgcc 

6541 aggtagtatc cgcctccggg tagccttgag aggtcacagt tccgaaggag gaatcgggcg 

6601 aaagaatgtc ccttctcact gtctccagct agggggtgag cttgggatgg cacaagaagg 

6661 aagggaaact cggggtggat gtgattcggt gctagagcgt gggccttgtg gttggttgta 

6721 ctcgctgaac ttaagattgg cagcccagat atggatgagc ctgggggacc ttatgctaaa 

6781 tgaaataagc caggctcaga aagacaaata ctgcacgctc tcactcatac gtggaagcta 

6841 aaacagttga tgtcacagaa atagagagta gttcctggag agtggggggc agcgggctgg 

6901 ggattgccaa ggctggttaa catatccaaa agtgcggcaa gatgggggaa taagcgttag 

6961 tgccctatag tgccggaaag cgcctggaat ccgtaagaat gtattgtgca ttttcaaata 

7021 gctagaagag tggactttga atgttcccaa cacaaagaaa tgataatgtt tgaggtgaca 

7081 gatatgctaa ttaccctgat gtaatcatta cacattgcat acacgtgtct aaatatcaca 

7141 ccctacccca gaaatatgta caattattac atgccaatta aaaataacag taagagcaaa 

7201 atcaaaaacc aaacaagaat gactttttaa catacatttt ttaaaaattg ccagtccaaa 

7261 tccaaatcag agggtcagca caaaggtcac taataagcgc ttttgctagg gtggtggcag 

7321 ggcccctgct gcagtgtgag tgtctccatc caggacaccc agcacagctc ggtagccttg 

7381 ctgggctcca gggaaagggg ccttgcttcc tccagggcac gtctcccagc atcagtgtca 

7441 agggagtcct gtccaagatg aactcacctg ccctcagggt cctaagcccc atgtagtggg 

7501 acggtccctt ccgatgccct ggtgggttgc ggtgacttct gcctgccctt gtttcccttt 

7561 cgtgaaactt cccctccttt aatctgagtc agaccccagg cttcaggggc aggcaggcct 

7621 ggccaatccc agcaccacgc cttcctggcc accatgaagg atcagggtga acatacaacc 

7681 aaagcgggcc aaggagtagg caaggtgtga ggtggggggc ctgcagcggc tggagccccc 

7741 ctgcactaca ggagaccccc caccaaggat taaaggagga tgagaccact gtcgagctaa 

7801 gactggggct gcagtctcat ctgaaggctg taccggaggc gggtcagctc ccaggctcat 

7861 tcaggtggtc acggcgggcc ccagctgagc tgttgcacac cataccgtat gggcccctcc 

7921 tggggccgtc acccatgtca gctttacccc acacggtcgc ccctgctcga gggacccaag 

7981 agacaacaag agaaggcacc tcccctacag aagccacagt ctgcttttga tagaatctca 

8041 gtcccgccta ccgcagggca tggacattaa gagactgggt cactggcggc catcacgaag 

8101 gctgcctgcc actgacgggt gtcatttatg cctgaagaag ttcctctcct tgtgaaagcc 

8161 aattctgagt tgtgctcttg tcacttgcag cgaagagaat ctgagccacg tgcttcctcc 

8221 tttgggggac cacagggacc ccagtcagga cactcgctgg cccatgaaag atggcgtctg 

8281 atctgggggc agctgatatg ttctgggggc agtttatatg cttctcagtg ggagaagggc 

8341 cctggccaag tctttgagga ggagtcaaga gtccagccag aaagacatgc cccaatatca 

8401 ggaccagagg caggaatgag gctagtgtgt aggacagagc tgcgggagga gccaggctga 

8461 ggcaggctgg gggtgtgcag ggaggtccgc tatcctagtg tgtgggcttg tggggagtac 
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8521 cgggaggatg ctagaaaccc aagcgcctac aggagcaagg caggtgacaa gtcaatgagg 

8581 caggctggtt gcacgaaaag taagtccatc ttgtgttttc ccagttttga aggagaaagg 

8641 gtacagagga atatttccct attagaatag agtgggggtc ggtgataagg agtggcgagg 

8701 actgtggcaa acccaggtgc ccaggcacag ctgaagggag caacatccgc ccagtgctgg 

8761 caggaacaga ggcccagagc tgctggatca gatttctcag gtcaatcgtg aaatctggat 

8821 tctacagaca cctacctaat agtaaacact ggctcaaatt gaaatcagtt atgcagacaa 

8881 aacaacgcat cctccccgtc accactttat tccagaaccc tggaataaga gccctgcagt 

8941 ggKcagcccc ttctccacgt cctgcagact ggatccactc tggccgatca cctctctcct 

9001 cccagatgct ggcttctgtg gttcagcctg ggttcccagc ccgtcttagc agccaaagtg 

9061 atcttgctgc tgcggcaaga caatttattg aaatattgat ttgtgttttt gttcttgcct 

9121 cgctttgttc tcggcccagt ccaagaaggc agcctgccac tttcgttctc caaccccaag 

9181 gaggttgtgc tgcagggtcc agagctaaag gggtctatga cccccttctg gctttggggt 

9241 tctgaggaga ctccctgagg agggggcaga ggggagggct ggggtgcatg ggcacaggga 

9301 ggggaagctg cactggggcc aggggagccc tgggaggcag gacttcagga gacatgagag 

9361 tgtgctagac cctgggagcc tgggtcccag cagggggtac cccaagaagt agaggagagg 

9421 gttacctcag tgtccgggca ccccaggact caacctgcca ctgggagctt aactgacaag 

9481 ccacagagct gcttgtgctt cctctctctc tctgccaacg tcccaacatc tccaaggtcc 

9541 cagcctatct ctgacgggca cagaggagcc caggggcaga gatcttgtcc ttgctgggtc 

9601 tggagcattc aagggctatg gggtacccat gaggcccaga cattgacctt gtgcccagag 

9661 gccagggcaa ggtctgtgtg cattcttaaa acagcccttt gggtcagtga ggaaggagtg 

9721 gggatagaag gggacagtgt ggcactgaga tgcatgcagg tgtggctggt gccacgtcag 

9781 gagggtcagg acctggtggc aggagctgag aggcatcagg gctgcagggc gggagcaatg 

9841 aggcagagac ggcaccagtg tctggctttg gggattggta gcaaggcaga agaggcggcg 

9901 cgggaagggg acggcaagtg ccagaggaag agcgtggggt ccgcagtgca ggaaacatgg 

9961 tcacggcaat gtctttcctg gccagcttgc tcgttgggaa gagctgggct gttttgcttt 

10021 gggagctctc ccacttctgg gggtgcagcc cttgtgggtg atcagtccca agccgtagct 

10081 gggacccatg ttaggcccat ggagcacagg ttcgaaagga ctgaaaactg gctgaagtgg 

10141 gtttgcctgg tggcagcacc caaaacaacc atccgcaaac tcaggctcat gcaccccaga 

10201 gccccctttc ctccacccac tcctcccact ccatgagagc tacaggcagc cttgggttct 

10261 gcatgcctct aaggactcct cacagaaaga aactgatgtc caggcagaga catcggggca 

10321 gctggacagg tctgcggccc aggagggaag tctgggctgg ccgtggacag ggcaaggccc 

10381 tcgctactgg aggcagtcgg aacccaaggg aagagccaag ctcaggtctc tccgctgaga 

10441 cctccagcca aggatactgt cctgacatgc agtgaactcc cctgtgggtc acttgggctc 

10501 cagcgggacc acctgctggg atgcgacctc ctttgctcct ctgccacctg gcctgcatgc 

10561 aggaggccag atgggtccct gctgcctggg gcaccccagc ccccaccatc ctgtgccctt 

10621 gctatgtatc aagcctggga actgtgcttc ccctgagcga gaagaagaca catttctcat 

10681 atatggatct cctccagaag tgagggaggc tgcactgctt gtttccactg aggagtcttt 

10741 caccagctcg ccctggtcat ctacgtccct cccagtcctg aatgcacctg acttccagct 

10801 gcctgctcgg ctctgcacgg ctcctctagg aaagtctgtc tgtgtgactc cacctgttcc 

10861 tagctgtgct gaggggggga ccaggagtga gccgtagcac cctgggctct gcaaaggggg 

10921 tggctgcagg accagccagg gactttacag ggcagccagt ccctcccagg ggcagccgcg 

10981 ggcacccagc ctccagagca gggggcggta gatggtgctg tatgaccggg gatggacatt 

11041 tcgaggccct ggggctgcat ggtagctgca gggtccacct cctgagtgag gcagggcctc 

11101 tgagggcaga ctcttcacag cggcagcagc tcccagaact cccaggtcca gcctagggcc 

11161 ccagaagagc caggcgggag aagcctgcag tagggctgtg gatgaagtcc aggggcccat 

11221 ccagcccagg atgcaggagg agtcctgagt gggctgaccc tagctcagag cacgggaaca 

11281 gatgagcttc caagaggctg atcggatcct ctacaggggc cccaggacag tcgtcacccc 

11341 tcatggaggg ctctgccacc cctgtttgga ggtgacatcg accactgcag acatgggctc 

114 01 ccaactgggc tccactgccc gagtgctctg gggccctgga ccaggctggc tcttccagag 

11461 tgcagggtcc tccttggcca tcatgagtgc ccacctatgt ccacatggag accctccttg 

11521 ctatgctggt tcacagagct gcccagatgc aggaggcaga cactcgtcca ggcccctggg 

11581 gccctcccac agcagcaccg ggagcctggt cccaagaagg atggtgtttc ctccgggttc 

11641 agcctgtttg gctcccctgg cccgggctgg aaagggtctg aggacatgcc tgaaagaccc 

11701 agggatccca ccatgaggtg gagagggcag atctgagggc cttagggagg caagagcttg 

11761 atggccctca ccttgggtca gaaggtcact ttgctgaatg ctccccaggg ctgtcaggag 

11821 gcttggccag ggcaagggtg aggaaacact gcgggcccct gaacctcagg cccagcctgc 

11881 aggggtctgg gtaaccccac gggcctcagg gaacagccag gagcttaggg ggcccgtggg 

11941 ccttgggtgg acgagtaggc tgcgtccagc tccaagtccc agccccatcc tggccgcggg 

12001 accctggctc tgagctgccc gccgggtccc tcctcctgcg tgcgatcggt cacctgccag 

12061 atggggcacc tactggggat gtgcctgttg ctgtcgctgg cctctgagtc actaaagcct 

12121 aagctccttg agggccgggt ggttgctggg cccgagtctg tgccccagcc ctggcccacg 

12181 cctgccctca gtagccccac tcctcccttc acggagcaat cccggcttct gctctccatg 

12241 ccaggatgag gcagcgaacg cggccagagg accccggctg ctgggcgctc agtgatgaag 

123 01 gaatgaacac gagcaccctc aggagccagg ctgggcaggg agggaggcgc cggctgcggg 
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12361 gacgttctcg ggaggcctgg ggaagtgagc ccaaagggcc gccccggcag atgctgaccc 

12421 cagcaggaag tcaccgtccg caaggagcgc cccgctaggc tcaggcttgg cacacgacac 

12481 ctcgcttcca cttcattttc ctaccttccg gatgacgaac tgcgtctcag acaactgaaa 

12541 ggactcgcct gagtcccaca gccggaaaga ggtcggggtg ctgccccccg cgcggggcgg 

12601 gcagaggaag ggccgggccc gagcggccac caggtggcgc tgctgccgcg agaacagcgg 

12661 cccgggccgg ggacgcaccc ggcgcccaga ctccgcggcc gcacccgggc tcccggggcc 

12721 ccagcggccc caagacagcc tctgggcctc accctaaata catcaattat gacctgagca 

12781 aaaagttctg ttccaatcgt tgctctccga catttccatt gtttgctctg gaaggagctg 

12841 aattatctat gctctcccct ccttcagtca aggagacgtg ggaggaaacc gggatgggga 

12901 tcagcagcct ggaggccgcc tccgctggaa ccgcgcccgc cccgccagga ggctctgcct 

12961 gctgggcaag gcgcactcag ccagccccgc aggccgccac gcgaccgagg ctcaccaagt 

13021 ccactgcagg tggcaggtga ggtcacctgg gcagctctag gcccggccga aatatctcat 

13081 ctgctctgtt cttgggcaag ttaccttact tgtccccacc tgcaaaatgg gcacaactgc 

13141 agctccgagc ttggagctgg ggggcttccg cccatgacag ttcaggcagg gtgcagtcag 

13201 cgcccacaaa tgtcccttgt ccttgtctct gccgtgctgt tgatactgat ggagtataac 

13261 accagtccct ggggactggc ctacccgctg gggctggctc atgcccccaa ggggatgcgg 

13321 cttcagcaaa gcttctcaac cgtccccatc ccagaggggc cctcctgcct cagaccagcc 

13381 tcctaaacag gtgcaggata ggacagcagg gaggttgggt tctggtgagc agaccacatt 

13441 tctcaacctc agcactccag atatttgggg ccagataatt cctgttgggt gaggcgggca 

13501 ccctgttcat ggcaggatgt tgagcacacc cctgggcact ccttgccaga acccagtgga 

13561 accctccccc tagtcgtgac aaccaaaagt ctctccagac actgccaaat gtcccccggg 

13621 tggggggagg ggtggcaaag tcactcttgg ttgggagcaa ctgaccctca ggggaccagg 

13681 acaaggagcc tgaagctggt gctcctgacc agccgggatc acagagtcca gcccctcctc 

13741 ccagagcaga gtggctccac cagcgtgtgg ccccgggcag gccgctcgcc ccctggagcc 

13801 agggtcctca ctgtcaaatg gggatgatca cagcgcctac tccacagggt tgctgtgggg 

13861 gtgagggcat aaagtcacat aaagggcttc agacggtgcc tgtcccagaa gaagctctac 

13921 aaacagtcac ttgcacgaac gcctttatga ttgtcgtcat tcctgttgtg tgctggggtt 

13981 tcacagacct cctcttatgg gggaaactaa ggccccatga tgttaaattg ctcaccaaag 

14 041 gtcccacgag tatggacaaa gccagaatga atggcaggga caaggtccca ccatccctgc 

14101 tctgtgagca catgttctac cgtgcccccc agctgagtgc aaagtgggct cccccaaggc 

14161 tggaggtctc ttgttccagg cagggcttac agatcaaatc tcatcacgac agagaaccac 

14221 gagagctggg gcaggaggcg tgcacctcat ttctctgcca gaactcccat atcgttccag 

14281 cattttgttt ccagccatct tccccaaagc ctgggaagct attcccagag agagccaagc 

14341 tgccagaaca ggattctcaa cttgagcctt cccaggcatg agccagtgca ggcctgtggg 

14401 gaggcagaag aggggcctcc tgctagaatc tgaatgttgt gtcccccaca tttgaaaggt 

14461 tgaaagctga gaaccaatgc tatggtatta ggaagtgtgg tctttgagag gtcattaggt 

14521 catgagggtg gagtctccat aaatgagatt agcaccctta caaaagaagc ctgagggagc 

14581 ctgCtcaccc ttccatcatg cgagggcaca gctagaaggt gccctctagg aggaatgggt 

14641 ccttacgaca catcagttct gctggcgtcc tgctcttgga cttcccagcc cccagaacag 

14701 tgagtaacac aattccattg gttataaatt actcagtcta aggtatttct gttattgcag 

14761 cccaaacaga ctaaggcacc tcccaagggc agtgtccagc cttagtgcct cctagttctc 

14821 tggacccagg taggtctggg tgaggaggtg gatggaggat gatgtgggtg ggccatcatg 

14881 ccttgcaccc ctatcaggtt gggggttggg acccagatgc tcaggaggac agggggctgt 

14 941 ccctggggtg aggcatgggg tgtgtttccc acctcatccc ctgtcccaga tccccagggc 

15001 ccttgatata tcctccccag accttcactt cctggccttc ctgcaccatg gctgggccac 

15061 tctaaatcct ctaccctata ggtgggtggg tgggcacacc aagcccccac ccccagcacc 

15121 tctcttctcg tgctttccac taagagactc acatgcagaa tttctcagac ctttcacctc 

15181 caatgctgaa aaattccagc aagcattcca cactttgtct tgacctggtt ttttataaca 

15241 agacacttta tgtccttctc tgatgctccc caagtctctg atttccattt gctttgattt 

15301 tcatcatcct aggaaatatt taaaatgaga acagttgagt agctttggta ccttctcaaa 

15361 aaatcaattg aaaagccagt tgctatgcaa cctcaagtgt tgccgatgct ctgctccctg 

15421 gctggtggta tattgtataa aataagatca tctcttttct tctgcttttt tttaagaaat 

15481 ggagaaaaat gttgttttag gaaaaacttg agataactca gaggcttctc ataagctaga 

15541 agacaaacat gtcactcaaa caagtgctac caattttatc cccactggag cctggtgagg 

15601 cctgggggac aggcaatggg ggagcggcag gcggggagct ggtctatttc tgtcacaatt 

15661 tgggatgaat aacaggaacc gagtatctgg gccactttct gcccaagtct gggtcatggt 

15721 ctcacatccc ccagagtctg tccccaccgt gggggggaca gctcagcccc ctcctgactg 

15781 ccatcgcctt ggttttgcca tgctaaaatt cttctagaga cgggggcctc attccttgac 

15841 gggaagcctg tggttccttg aagagcagct catcagaatt catccccacc atgtgccaaa 

15901 ctttgtcccc agagccccat cgagaatccc tggctgtgct tctctgtgag caaaaggggt 

15961 ctgttccctt ggacacagga gctaggcctt ttgcccccag atcttggctg ggaagtcccc 

16021 atcctggctg ctccatccat ggtcccctat tgcctgccac cctgctgtat tccttgccgg 

16081 tgctcaccac tctgtcgctg tcttgtctgc tgctactgcc tggacacagg gctcctctcc 

16141 cctcctcgtt catgcacctc cctgggatta aacagcagct gctcgggaag ccagaggctc 
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16201 


tgggctggcc 


ggatgctttg 


ggggcacaag 


gacacatctc 


ctcccacgtt 


cafccagacac 




ctcagagacc 


tgaaaacagg 


gcctccccca 


gtgctatgca 


aagtgaagtc 


tgtgcctggt 


16321 


ggacacaaac 


tgttccagtc 


cacagtgaga 


taagtacaga 


aactagaaac 


ttttatagca 


16381 


attggacggg 


ttgagtttat 


gtccacagaa 


tttaataata 


aaaatccaga 


cctagagttt 


16441 


t tggatacct 


cggtttttgt 


cacatttgct 


tttcccagta 


attcatttat 


aacgtatggt 


16501 


acggaagtat 


tggtctgtga 


tgggtttgaa 


ataggaaaaa 


gaaaacagcc 


aggtctttgg 


16561 


cattgaggtg 


cactgtgtac 


tgctgctccc 


aggatggcct 


ctcccctggc 


cacctgggga 


16621 


tggcagatgg 


ctcctccggt 


cccacagacg 


ctgctcccaa 


gccctgtggc 


ccacagcaca 


16681 


ttcgctaagt 


cagccagggc 


ccatgaccag 


aacccatttt 


cctagccgct 


gtgccctcta 


16741 


ccccaagcct 


tcagcccagc 


ccctagcagt 


gccacctcct 


ctgctctact 


atttcctgca 


16801 


cttttgcagt 


aggtccaacc 


taaagcaacg 


aggactcata 


gcctcggctc 


acttcccctc 


16861 


tcacagccca 


gaacccccca 


gagaagggca 


cagcagcggg 


caggcccggc 


tcttctggca 


16921 


gtctacactc 


agctgatcaa 


ctcaggagat 


ccccgggatg 


gagcggaccc 


catgggcact 


16981 


ggcgtcccct 


acccccatca 


ctgcctcctg 


tgcttgatct 


gggaggcttc 


taggaggaag 


17041 


gcggggtagg 


gccaggcaga 


aaaaggcggg 


cagagtatgg 


ggtgagcttc 


gctgagcccc 


17101 


actctgggca 


ggcatgagtt 


aggagcagaa 


aaaaaggccc 


agaatgctgg 


aatgttccag 


17161 


aacattccag 


gaggccaatc 


atccctgaat 


tcccggggtc 


tgacagaggg 


accggctgtg 


17221 


tgttgtcctc 


caagctaggc 


tgaggggcct 


gggaacaggc 


actggggtgt 


cctgggtccc 


17281 


acacccctca 


caacccagca 


ctgcctctcc 


cctgcctcct 


cagctgcggg 


tccactgagc 


17341 


cctcggggcc 


gggggtaagc 


tggttccatc 


tggagttagc 


tctggaagcg 


tgcgtccagc 


17401 


Ctccggttct 


gtcttccaat 


ccctcaccac 


catcagttct 


caagcccctt 


cccagcagga 


17461 


ggcggaagca 


aggccagcct 


cagttcacct 


atgtcctctc 


tgttctcttt 


gtacctgggt 


17521 


cgcagaagtt 


gcccggtgcc 


caccacaggc 


atgagctgcc 


ctggtttgct 


ggctatggat 


17581 


ctcagcaccc 


ccagggcctg 


cacatgtccc 


catgttgtcc 


ctgggaagag 


caggctgcac 


17641 


ctgacaaccg 


gtcccctgcc 


cactgcccca 


ttcccaactc 


tgccaaaagt 


ggaagtgggg 


17701 


ccccagcatg 


gccccagtga 


agcagagctt 


ctgcaggtgc 


atgaggggtg 


ggggtcccag 


17761 


caaagcagaa 


cttctgcagg 


caaggcgcat 


gaggggcagg 


ggccaaggag 


gccctggcca 


17821 


ggagggtggt 


gcctgctaac 


ccaggggcgt 


gggcttctca 


tgcaggggag 


accacgtacc 


17881 


tatggcagcc 


tgcgtcctca 


ctccaagatt 


tcccatttcc 


ttcaggtgga 


cccgctgcag 


17941 


gtggaccccc 


tgccccagcc 


acaggaaRcg 


tgagctagcg 


cccatagggt 


ggaacacaga 


18001 


aacctcatta 


ctcctcccca 


gcccactgca 


gagatgacgt 


gccaagggaa 


agcatgaatt 


18061 


ctctttgatg 


tcttcattga 


aatcagaaat 


aaaggactga 


gcgtgacaga 


gccctgtggc 


18121 


atgccaccgg 


agacctttct 


ccagagtgct 


gaattcagtc 


aaacggcatc 


atttggatcc 


18181 


taccattttt 


aaatgatttt 


ttaaaaaatc 


atttaaaaaa 


tttaaatttc 


acaaatttaa 


18241 


aatgcagtgt 


tttgtttata 


aaagaaacat 


gctggtccag 


gctcagctgc 


tcacacctgt 


18301 


aatcccagtg 


ctatgggagg 


cagaggcagg 


agaatcattt 


gagcctggga 


gttcaagacc 


18361 


agcctgggca 


acagagtgag 


accctgtctc 


tacaaaaaaa 


aaaagaaaaa 


aaaattagtc 


18421 


gagtgtggtg 


gtgcgtgcct 


gtagtccaag 


ctacttggga 


ggccgagatg 


agaggatccc 


18481 


ttgagcccaa 


gagatggagg 


ctgcagtgag 


ttatgatcac 


accactccac 


tctagtctgg 


18541 


gcaacagagc 


tagaccccat 


ctcttaataa 


taataataat 


aataataata 


ataataataa 


18601 


aaaccatgct 


catcacattg 


aatttaaaaa 


attcagtaaa 


gtacaaagaa 


gaaaatgaaa 


18661 


attaattagc 


ctactgccca 


cagagaaatc 


ctgttcacat 


ttgatgtgtt 


ccttccagtc 


18721 


tttttggtaa 


atatatttgt 


acaccaaatt 


gagatcatac 


ggaaaactga 


gatcctgctt 


18781 


ttcaaacaac 


aaacagcttt 


agactgtaac 


catttcccaa 


tgccattaaa 


catgtttcga 


18841 


aaatgcaatt 


ttgcggcagc 


ctaatactct 


accacatagg 


tttatcatta 


cttaactaga 


18901 


ccactattat 


tggacaatat 


aggttgtttg 


ctgttgttca 


catgagagat 


aatgccatga 


18961 


cccttgtctt 


gttcctaaaa 


cttggcccac 


agctgggatc 


actgctttgg 


aacagagttg 


19021 


tagaaatgaa 


tgtctggggt 


cacagaaatg 


aatgtctggg 


ggcaaacatc 


aagaacattt 


19081 


taaaggccaa 


atgcggtggc 


tcgcgcccca 


taaccccagc 


actttgggag 


gccgaggcgg 


19141 


atcacctgag 


gtcaggagtt 


caagaccagc 


ctggccaacc 


tggtgaaacc 


ccatctctac 


19201 


taaaaataca 


aaaaattagc 


tgggtgtggt 


ggcaggtgcc 


tgtaatccca 


gctacttggg 


19261 


aggctgaggc 


aggagaatca 


cttgaaccca 


ggtggcagag 


gttgcagtga 


gctgaggtcg 


19321 


cgccactgca 


ctccagcctg 


ggcaacagag 


caagactcca 


tctcaaaaaa 


aaaaaaaaaa 




agaaagaaaa 


aaagaacatt 


tctaagggca 


gtgataaata 


ttgctacatt 


tccttagaat 


19441 














19501 


cctaacaggg 


ttgatgcttt 


ataaatggtg 


tattatttat 


ttctttaaca 


gtagagttag 


19561 




atatgggctt 




acttggattt 


ttttctttgt 


ttccttccat 


19621 




gtttccttat 


gatttagagg 


aggttttata 




tatttatcac 


19681 


attgcaaatg 


tttttcctgc 


ttgtcatttg 


ccttttcatt 


gtgttttttg 


acagaagttt 


19741 


ggtactatac 


ttatccaaat 


atatccatat 


tcccctttac 






19801 


tgcttagaaa 


gctcttgaca 


accaatattt 


tcttcttgat 


ttgtgtgatt 


taaatagttg 


19861 


aatgttgtat 


tggcttatat 


ttgtcaYatg 


ataatttaat 


ttcctattta 


ttaaatttat 


19921 


gttatatatt 


tatttataac 


acaataagca 


ttctgtgttt 


atcacggtag 


gttatttgta 


19981 


agaggggtat 


tttcaactat 


tttattcaac 


tgaactgtct 


atgaaagttt 


atgcacatat 
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20041 ggcactattt taattatctt cactttacaa gatgtttgga ggaagtctct tacttgcttg 

20101 ggtcccacag ctgtctcggc cattcttgac cgttcatgtt cctttggagg aagtctcttc 

20161 cttgcttggg ttccgcagct gtctcagcca ttcttgacca ttcatgttcc tttggaggaa 

20221 gtctcttcct tgcttgggtt ccgcagctgt ctcagccatt cttgaccatt catgttcctt 

20281 tggaggaagt ctcttccttg cttgggctcc gcagctgtct cagccattct tgaccgttca 

20341 tgttcctttg gaggaagtct cttccttgct tgggttccgc agctgtctca gccattcttg 

20401 accgttcatg ttcctttgga ggaagtctct tccttgcttg ggttccgcag ctgtctcagc 

20461 cattcttgac tgttcatgtt cctttggagg aagtctcttc cttgcttggg ttccgcagct 

20521 gtctcagcca ttcttgaccg tccatgttgc tttggaggaa gtctcttcct tgcttgggtt 

20581 ccgcagctgt ctcagccatg cttgaccgtt catgttcctt tggaggatgt ctcttccttg 

20641 cttgggttcc gcagctgtct cagccattct tgaccgttcg tgttcctaga tggtgcttag 

20701 aatcacttca taacatttcc aactatgaat ctgggaaagg ccaacctctc gattataccc 

20761 agcttcccat gaggacagca tttgtgtctt catttgtgat cactttcttc ctatcattaa 

20821 tcaggtacaa tttccttcca atgggtccat tgtcttgtca ggactattcc aggtattcta 

20881 catttttggt tactaccata atggaaattt tctctccctg ttttccccac taccaaggct 

20941 gttgtttttg gttcacacag aagctactga ttttttaaac aatggtctga tatttggcca 

21001 catcaccaga ctcctctcct aactctacaa gttRtaccgt ggattctttt aggttttccc 

21061 gacacatgat taacagcaaa caatRagaat ttgtctctgc tcttttccaa tactagtttc 

21121 atgtcacata caagtatcta ggattttcag aaccacagta actagtttat gggtcttttt 

21181 tgtttgtttg tttccaagac tatctccagg gcagtgctcc ccagagcagg ttcagcaggc 

21241 tgcggatctg ctgagcaaga agcttccgtg gagaaagttt tcagagacca cctgtactgc 

21301 tcggactctc ggaggtacac catgcacagc tccacaggaa agtctccggg aaatttcacc 

21361 cccaggaatg cccattttcc tttgtttgac cagcattttc cagacttctt tgaccttggg 

21421 acccctctat agtcctctgt aacatctact catatcctaa gaattcgctg tgcagaacac 

21481 ttttagggaa atgtgcttct ttttccgcct ttaaatacaa tcttggttgt cctgttgccc 

21541 tctgttacta gttttctggg gatctggaaa aggaggagtg ccctgtgagg ggctaYgggg 

21601 ccgtcacgcc cttgccctgc cagaggcttc aacactggag ccctgaggcc tctctcccag 

21661 gctcccccta ggcagcaagc tcctcatgcc tgcagttgct tctgctaccc ccagctccag 

21721 accccgcagg gctccagacc cacatccaca tcttcttggc ctcctccgct gggatggcac 

21781 caggtacctc taactgcaca tctcagctga cctcttcagc ttccctcctg tgctccccaa 

21841 tctcaggggt tgataccatc atctacccag ctgagagtct ctccgccttc acccacctag 

21901 tgccacatcc tctcgaagtg gcctcctctg tcctcctgtc tgtatccctg gacccaggcc 

21961 tggtcccagt ccagctgcca gcatccccca tttgatggca gtgactgtct cctaagccac 

22021 tgctcccacc tgccccttct gcacacagga gccagtgatc ttcttagaaa tgcacatggg 

22081 atcgtgtcac tgcttaaagc tgccctcatg ggcaagccaa accccttggc gggagacgca 

22141 ggtctgctct gctccttgcg atcggctcct tccgacgtct ttggcattgg gtctgctgtt 

22201 cctgtccctg tgctcgtccc tctgggttca gctgccctcc ctcccctccc agcctatgaa 

22261 ctcagagcat ctctatgctg gtgacttcac tggaccatct tcaggtgtgg gtggaacggt 

22321 cacttcctcc cgttgctttt cctgaccttg taaggaaaga ctgcctgacc cgcgtctcct 

22381 ccatccagcc ctgagccctg gtacaattgt ttccttgtct agatctgcct tctggaagcg 

22441 gtcccctgag gacagagact atgccagtga atattcccag cacagcactt cccacacaga 

22501 agcccttcaa acagtatttg ttagaggatg agttttgtca tttggaaact gatgtgatga 

22561 actgtatgaa tagatttcct gatattaaac aatctccaca ttcctaggat aagccctaga 

22621 tgatcacggt agaagtttct ctttatacat tattcaatta aactcttaat gccttgggat 

22681 ttttacttct atttttagaa gtgagacagg tctgtacttt gtcctgacgg gctacctttg 

22741 gcatctttgg gtataagggt cccattgact acgaaagaca cattgggtag atttcagaat 

22801 acatgagttt acagggtatt gtattagtct tgcctgataa atgcttgcaa atgagtatga 

22861 atgcactggc aaaactttgg aaaaggtgaa gaacttgagt tccctccatg gttatgggac 

22921 ttatggcata ttctccattt tttggagtca gctttgaaat attcacatcc caggaaatcc 

22981 tcaatttccc caggatctcc gtgtccctgg catagagatc ctcaaattct caccttcagg 

23041 gtttgactct cttctgcaaa tgttctgact gtgatcacat actccttcta cctacctgag 

23101 gcagttaagg aaaaaaaaag aaaaagaaaa agaaaaggct tagggtcggg ataatttagc 

23161 atagagcctg ctaaggagta attaataaag gaggctttat tccatcaagt actcaagact 

23221 gggaaagagc agacactttt caggccaaat aagatccaaa gctttgccag cactggtaag 

23281 caattaaccc tagtcaatta caaaattctt ctttttaaac aatgtaggag caacctgggg 

23341 gaaaatgcaa gccgccacct ggcacccaga ccaatcaaKg ggataatctt ttctcccctt 

23401 tctagcccca ggaccaggag tttcccattt tKggaaactt ccaaaagagg atataactct 

23461 gtccatctct catgccaaag gcttgagaat tcctaaaaca ccagaggatc ctggccagga 

23521 agggacaggc tggaaagacc cggcttgggg aaggccggaa cagggaactc cactggaggc 

23581 aggcagccag ccacatggga tagtccaaag atcccccccg cacctccccg cctgtgaatg 

23641 gagagctgga accggattgc tgggctgggg cccaccccca ggctctgcat ctgtgtggtc 

23701 caaagccccc tgccacaaac ctgggataga gggctcagcg agcctctctc tgccctccta 

23761 gccagcattc actggcagag gatgccctgc acatccctcc tccactgatc atccctggac 

23821 ctccgaagtg gccggctgca gagagcctcc caacagcttc ctagtgggca gatggccctg 
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t^attttt 3 ^ 


C ttttttt CC 


24421 


caaata tt 


9 ?ttcacca 9 




cacacccggc 


9 


99 


24481 


ac 




t 

9 a9 ^ 999 


c^gg c caa 


ac cc gacc 


caag ga c 


24541 


tacccacctt 


cctcccaa 


t 9 t°tttt a 


99 9 


9a ?^? a ^ a 


gcctggcccc 


24601 


ta ta ttt 
ttct^t cc 


tttttttttt 


ctttt ° 


aaa gaa g 


ga a 


gg tta aaa 


24661 




cccaccccta 


t tttt^ttt 


ca cage 


gaaga ggac 




24721 


ta ct tata 
t^tctat a 


aaattcacaa 


9 r i- 


t ttt°tttt 


a ^ ac ^ 9 


aatttttaag 


24781 




ttcctc aaa 


agaagg ca 




ctg agg 


geaagggaaa 


24841 


t t 


actactaatt 


a9 ttttaaa 


tt t 
Ca 9 9 Ca 


caaaaa 


c a gee a 


24901 


tta catcta 


ttt 
cccctt^tt 




agtgtattag 


a9 f^ ° Ca j| 


a ? t-f" 


24961 


tt ccaattt 


9 


ccaagga 


gttttgggat 


9 9 9 


c c 


25021 


acac aa c 




9 


gtgggtgcct 






25081 


cgcccggcgg 


qtqtaqqqaa 


^ a ^ ttta aa 


tc 9 


t tt t 

tataa^t? 9 ? 


atcccttct 9 


25141 




ccttcqtcat 






9 9 


ttt 
a gcag g 


25201 


attttatttt 


ttqaqactqa 


gat tc tct tt 


a tt 
g acagag 


caac a 


a aa a c 


25261 


tctqct t tct 


atcctattat 


gatttgattt 


9 


gccagaccac 


a99 °^ 9aat 


25321 




ctaa 3 tcaca 


ttaa^ctc 


cgggtgctgg 


a ggece gg 


c gc acct 


25381 


gggcatgtca 






gtgttcttat 


c g gaaa g 


ggcagagtca 


25441 


cagtgtgcac 


ccctccatct 


t 

cttt ca 393 


aagggca 


ca cca ac 


aaagtgcttc 


25501 


tcctct^tct 


^aca^atc 




.acacgggcc 


g ac ^g ggg 


gctcctctct 


25561 


cc e g c 


gcacagca c 


g g gec 


a cccagg c 


ac gtccctg 


gcggctacac 


25621 


gcac 


accgccagga 


aaggacg cc 


caggtttggc 


tgcaagagga 


agtgggacgg 


25681 


gcaagagagc 


ccagc ggga 


cagcagccga 


ggccagagga 


9a ?? a ? 9C ? a 


cagatttaat 


25741 


aegggggcag 


t^actct 399 


gagggtegtg 


ttaaccacgt 


g c ga g 


ct tgacatc 


25801 


°aaacaactc 


atccat a 9 c 


gggactgccc 


cgc aag gg 


aaccaa c 


agagacag 


25861 


gaaacddc c 




acatttttca 


ga acaaacc 


aaccaatcca 


gcgcccatgt 


25921 


9 


ctcctttatc 
c a c 


agac caca 


c c gageca 


c a ccccc 


gccctcatta 


25981 


tt C 


aac cagga 


cege ccac 


ccccagagcc 


cac ggaa 


a caaac g 


26041 




a ?ttaccca a 


ccc g cccc 


a egg cc 


cc gcagaaa 


caacatggac 


26101 


t 

gg g 


9 t aC t C t a 


C 9 ° 




cccc gg gc 


ctccctgtgt 


26161 


ggcccccg g 


gec gg a g 


c c cc c 


ggga c g 


ggag aacag 


ac c ca 


26221 


tcttata & a 


g aa ggeca 


ca aaaa 


caaggggtga 


aagggagcag 


ggwtcttgtt 


26281 


tctccaa"a 


acc cagggg 


agecagggag 


aggcagggtg 


g ctggcaa 


acgcacacac 


26341 


c ccaagga 


ggecca gec 


cac cccag 


gcccctcact 


ccgagggggt 


ggaggcaggc 


26401 


ac^tt at^t 


acag gcac 


CaCC ^ Ca 


age ca 99 a 


ggatgatccg 


caccaggacc 


26461 




gggcacccaa 


ggac eg ca 


gg aga 


Ca t- 9aCC 9 


aaagaaacag 


26521 


qqa catca 

gg gg 


tccttct^t 


caggccc ag 


gac ggc c 


ggc c geca 


a ggga gac 


26581 


cacctt^ 9 ^ 




ccca gc 


ccacccagg 


eg caeca c 


aac aggagc 


26641 




cccctccctc 


a gccccc 


gece c gee 


c ccaggc 


9CCa ^ aCCa 


26701 


cctqct a 


t 


ccc cc ga 


gcaaccccac 


ccaccagccc 


aggg cc 


26761 


cacctqqqqc 


t 

cacaacct? 9 


aa ^ 9 9 " ca 


gggaaagcag 


c aege 


9 ° 9 a 9 cc 


26821 


ccaqct 
„„ 9 9 „ 




cc caacag 


gaacg ggg 


ga ggaccc 


ccgcggcaca 


26881 


ggcagggctg 


ggaccggccc 


ggcagaag 


9 


g gg 


gggtgtgggg 


26941 






c ggaaa g 


ggggcggttc 


aggcccacc 


accaggtata 


27001 


atgttttaac 


qt tt t 
g g aa aa 










27061 


ctgttttgat 


ggctggagct 


cctgcagtcg 


ccgtctctgt 


atttatcaca 


atcatattct 


27121 


tacacgtgtt 


tcattcttca 


catttaaaac 


cattattctg 


ageaggtgee 


aatgggcttc 


27181 


agcagatatc 


aaagggaccc 


gtggcacaaa 


aacagggggg 


tgaactcccc 


ttctacagaa 


27241 


cacagccagc 


aagaccctcc 


agaaacatag 


gtcaggttat 


cccgggctcg 


aggctgtcag 


27301 


gggcttcctt 


cgcactgagc 




cagcttccca 


ccgcacacac 


ccagggccct 


27361 


ccctagcgca 


gcccccttcc 




tttgttggtt 




Yactcctcct 


27421 


cgaggccagc 


tcagcagtca 


cttccccctg 


gaaggccccc 


cgcaggcctg 




27481 


catcaagcac 


aaggtgtgtt 


atttctggag 


ccctctgtgt 


gcccctgtgg 


gctgggcagc 


27541 


cacctgcatc 


ctccctgctt 


gcatgcccag 


agecaggcta 


agcagcacct 


ggcaggcact 


27601 


cagggatacc 


cgctgaaagg 


ctgaagecat 


tggcaccatc 


aagaagtgga 


gaaacaggga 


27661 


ctggctgtgt 


gcagaccgag 


cccaggggct 


gctgacagct 


ccggacatgK 


ccaagacata 
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27721 cttcccggag cctgctgcca cgtcaggcca gcacacactc cctgcactcc aatctgtaga 

27781 gcctgcatct gagcaacagt gattgctacc tcaaaacagt gggaaggcac ctcaggaaac 

27841 cYgagggttc acaaggagaa tgacatcatg atggatgggt ggacaggcat acagggcgtg 

27901 caagacaagg atcccgaggc ccgctggcat cggtgctggg aacaaagatg gcaccgatcg 

27961 tggggcagag ttcatgacat gtggaccctg ctgagggtac catccagccc aaacgcagct 

28021 cctcttggca actgctgccc agcaaatcta catctgattg taaatgggct ccccacaact 

28081 cccacctctc tttcctatag gcaagggaca actcatcaga cccacgggag cgtggctggt 

28141 caccgtggct ccgggctgta ggcgtctggg gagggtgggt cagcaacccc tgtaggcgct 

28201 gagggttatc ctctagaccc ccagtgaccc atctcgcccc tgtggctgac aagatggagc 

28261 tccgaggagg gcacccaggt ccccaggaag ccttgccatt tggtcatcgc cctccgtttc 

28321 ctctgtaaaa tgaggattgc agtcccccca tcacaaagct gttgtgggga tgaaaccgYc 

28381 atgcaggtgc aatggaatgg tgcttgtgca aacaatgaaa cggtttttca ttgctggtaa 

28441 accatagtgc cctgcagctc cctgtgaggt cctgggggta gggtaCgcac ccctaaacca 

28501 cagaagtcct tgcccagctt tgagggtcag actagattgt ttagtccacg ctggcacagt 

28561 agccttaaaa gccagttttt cttaagagag aaggggtttc tgtaattgtc cacagtattg 

28621 ctttctctat cacgacaaag cttggccagc ttggctcatg gagtctgacc gagggaaggt 

28681 cccactgcag ctgaagggag cagaacagga gagaggagaa aggccaggga tgaggggcca 

28741 ggcttctcct catcctcaac cccaacctca caggtcatac agaacagtcc tgacatctga 

28801 gaaggaagga aatgcaaaag agcttgatcc tggctgctgc ctgggacctg agaggggaca 

28861 ggttcctagc agtaggagtg gcttccaggg cccaactggt ggcccagcct gctcaggcct 

28921 gagagggtgg cccagctccc agctgtgtgt agcagatgcc agggccccct gcaccccaga 

28981 gacacatggg caagatagtt ggggactggc ttgcaggttt acctggctct agggggcacc 

29041 accaggcttc agggcaagcc caggggggca ctggcctaca ggagaggagg gcaggcctgc 

29101 ttgcaaggct actgcagagg ggactcatcc ctactgcccc ccaccaggcc tctgcctgca 

29161 aatggggctt ttgggaggta attaggggta gatgagttca tgggggtggg accctcatga 

29221 tagggttagt gtcctatagg aacagacacc agagagcctg catcctctct ctctgccgtg 

292 81 ggaggaccca gcaagaaggc ggctgtctgc aaaccaggaa gagaaccctc actggggaat 

29341 gacccagctg gcaacatgat cttgaaattc ccagtcttca gaactgtggg aaatgaattc 

29401 ctgtgattta aacccctggg tctatggcat tttgttatac cagcccaagc ggaatgagac 

29461 aaaggcttta tggggaagag actgagccac ctcccacagg ccagactgca gccttccttc 

29521 tatcctgggg ctgagtgccc gagatgaggg cagcaccaga tataggaacg aaagaaagcc 

29581 ctggccaggc gcagtggctc acRcctgcaa tcccagcact ttgggaggcc aaggcggtgg 

29641 atcacctgaa gtcagaagtt caagaccagc ctggccaaca cggtgaaacc ctctctctac 

29701 taaaaataca aaattagcca ggcatggtgg cgagcgcttg taatcccagc tgctcaggag 

29761 gctgaggcag gagaatcact tgaacccagg aggcggaggt tgcagtgagc cgagatcgcg 

29821 tcattgcact ccagcctggg agacagagca agactctgtc taaaaaaaaa aaaaaaaaaa 

29881 aaagaatgaa agaaagttgc atttttcccc acgtttgagc ctaagcgtgt aaatgttgac 

29941 ctcaccgtcc acacagaccc tcagcacagc cacgggcccc tcacaggctc tcaatgggac 

30001 ttctcaaact aaactgaccc tggcctcttg tgcagcccac cacaccctgg gacccagaat 

30061 ggcctgggcg agcctgacct ggtctctcct tccccactgg gctgaggctg gcacccagca 

30121 gacccagcag aaccactgcc tcaccatgaa ttgtggctct gagtagatca gagccagccc 

30181 tcctgccagg aggcggagag ggacactcag atgctacgag accagctcca ggatcaccgg 

30241 cctcgaggga aaggcaggga accagaggat atattagtgg gaatctgccc actggggcct 

30301 ggaaggacgt ggcatgtcat gagacctgag aaacagaaca gggtagctca gcaaagcagg 

30361 gatggcgctc ctggcagagg aggcagccca tgcaaacctc ccagggttgg aagcagcgag 

30421 gcaccttcca agcacagagg ggatgtggaa gggatgcccc agcagaggag gtgggagggg 

30481 agcactgagg ccagaccacg aagtcaggag aaggattcca aatcctctct aagagcagtg 

30541 ggaggacatg gaggaaggat gatggcgggg catttaagaa gcttactcca ggtgccttac 

30601 ggaaagtaga tggaaggggt caagacgtgg gaatgtgggg ttcagctttg gggatgagtg 

3 0661 cagagaaggg acggatgtga gggatgatgt ggattggaat aggggtcatc tgccccgcct 

30721 gagcagggag aggctggggc tctggaaggg cattccggca gctcccacag ggcggaggga 

30781 ggagatgcgg ggctggccct ctcctggctc ctggccaggc cctgcccctc aggacgcaag 

30841 atcttcccac tctcctagcc ctacccctgg ggccgaaggt ggcagctctc atctcaagaa 

30901 ggaatcttgt gtccttctga acattgagca gatctaatcc tgctccctga tttgggacag 

30961 gaggtgacca accactccaa aagccccgtc gctgggcatg gaactccctg ggatagaccg 

31021 tcagcctcac attctgcttg gaggtggggt cgaggcagct gagccctggg gtgtcaggct 

31081 gctaggcgtg ggcccctggc tggggagctc cactgtgggc tgtccgccag cagccagctc 

31141 ccacctcctg cgctgggtgc cctgaccctg gcctctggaa gcctcagatg gttcggagat 

31201 taactggggc tccctggctg ctgaggagct aSagagggca tgcctgcaaa tgctcccagc 

31261 aggtagccag ggagaaggag gccaggccgc agccaaggcc ccctcagccc tccctccctg 

31321 gtagccttca actgtggtgc gctctccctg atcctctccc caggaaacct ctgtcccttc 

31381 ctcccttcca caccagcaca gcacatagtt ccccaggcat gcccactctc tgggaggcag 

31441 gcctgagcaa ggccgtggtg gggcttgcga tggcctcctg gctcctctgg gtggctgctg 

31501 ggcYtgggat cctgctccgg agccctcaac ttggccccct cccaggccag gtctcagcag 
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31561 


cccctcccag 


cagtcaggaa 


ccccatgggg 


cactgtggcc 


cagagaccac 


tgaccactgg 


31621 


cctgggtggt 


tctcctgccc 


ctgcgtgtgc 


tatggcaaag 


tccaacccac 


agtccggggc 


31681 


cgggctcctt 


ctgctgtcca 


gcctctcagt 


ccccgtaggt 


taatatgata 


catcaggaac 


31741 


agacatgggg 


ggcaggcagg 


gtttgtgggg 


catgtgagag gcaggggtgc 


tccacagtgt 


31801 


ccttgtcctc 


cgtgggcctg 


aggagtgagc 


cccgtcctgc 


cccctctacc 


atcccagggc 


31861 


tccacctgct 


tttcaggagg 


ctgcaaaatt 


gccaaaattt 


cctgcccagc 


cctggccaga 


31921 


ggcctgcagg 


gcccacctgg 


gcatgggaac 


cactgggagt 


aatcagtgta 


attcccgaca 


31981 


aaacactggc 


gtcctcagta 


aggacacagc 


ggcaacattt 


ctgcagagca 


gagagtcgac 


32041 


tgatgtggca 


aaaagagcaa 


ctgaagacag 


aaggcgacct 


ccacatgcct 


gcctcctgca 


32101 


catccccctc 


ctcccccggc 


cccacccact 


ttctctgcca 


gacaggagac 


aggcggtagg 


32161 


gctccccaac 


ggtccccctc 


gtgtcctccc 


caccccccca 


cccacacaca 


ccattcttca 


32221 


ctacgatcac 


caagcggaat 


agatctcacc 


ttccccttcc 


actgtctaaa 


accctcaacg 


32281 


gctctccagt 


gccttcggaa 


gcagggccca 


gcccctcgcc 


tggcattcgg 


agccctctcc 


32341 


acgctggccc 


tggctggctt 


ctccatgcgt 


cacttcagat 


acctccccac 


atggcaccct 


32401 


tgctcctgac 


agcccagcca 


gccttcccca 


gagccggctg 


ccctctcctg 


ctgcctgtgc 


32461 


tttggttcag 


gcctggaatc 


ctcactctcc 


ctgtgcagcc 


cctcaattct 


ttagatatct 


32521 


caggagcatt 


tgaaaaaaaa 


gactcatcct 


aaaaaatata 


tctcaaagga 


aagaggcagt 


32581 


gaSctccttg 


cttaaaccct 


tcctacccca 


aggcatgaga 


ccgcagttcc 


tggcgtgacc 


32641 


tccaggccag 


gagggtctgg 


ccctgctgag 


tcttctcggg 


ctgccctggc 


cctgctccct 


32701 


gtgctccagc 


ggctagcccg 


gctgctcctt 


gcacaggcag 


cctgtctcca 


cctcaggaac 


32761 


tttggacttg 


ccgtgcaact 


gctcccagca 


gtgYctagcc 


catagtaggt 


gctcagtaaa 


32821 


catttgctga 


atgaatggaR 


tgaatcactt 


tgaagatatg 


gaataacagg 


aaatgggcaa 


32881 


aatgtatgac 


ccagcgtagg 


cctgtgaccc 


ctctcatggg 


gaggccccca 


gctaagggcc 


32941 


tcccctcccc 


tccccccagc 


agccctgggg 


gcctcctccg 


catccaccct 


acccaaggcc 


33001 


tgcgtgtgct 


ggggagaagc 


agctctcctt 


tcccaggttg 


cagRgggagg 


ggaagcccag 


33061 


ggacaatgtc 


cttgagtact 


gtctgagaaa 


ttgatcctgg 


aatctttaaa 


ccttgggggg 


33121 


aaatgttcaa 


agccgcagcc 


atctctgtgM 


ggagagaggc 


caagggctct 


cttgtttggg 


33181 


gaaacacatt 


gtaatgcgcc 


ttttgcttat 


gcaaagaYgc 


agccaagttt 


ctgttctgga 


33241 


atagacaatg 


aggaaggagc 


ggctctgcat 


ggcaatgtaa 


agtttgcctc 


actggtggca 


33301 


gagttccacc 


caggggatgg 


tgagccctgg 


gaggtggcct 


gcggctgctg 


ggcagaggtg 


33361 


actttgtcct 


ctgccaaggt 


gggggagggg 


tgcagaaagg ggggaaccgg 


tgcaggctcc 


33421 


ctactgggga 


ggcctactgg 


cctgccagcc 


acagaggaga 


caggcagggg 


ccctggaaca 


33481 


gcagccaccc 


caaccttcag 


gaaagttccc 


atRtccttcg 


ctcagctgag 


ctttgggcca 


33541 


ccagaggcag 


cctgtggccc 


acctggcaaa 




ggagcaagcc 


aggcctgaaa 


33601 


agtctgtcag 


aggcctgggg 


gaggggggcc 


tgctgctctg 


tggcttctct 


gggaccccct 


33661 


gaaataagtg 


aggagaaggc 


tgttctctct 


gtgccccttc 


ggaggccagg 


cttggaccac 


33721 


caggatggag 


gcagagaaga 


gagtgcagcc 


aggagcaggg 


gacagaggtc 


tgctggcctc 


33781 


cgtggaggaa 


gagtttaggc 


attgcatggt 


gcctaaacgg 


ggctgccaga 


gcctgggctt 


33841 


atcatgagaa 


tcaacgttcc 


cattgcagat 


cttcttcacc 




cagcagatgc 


33901 


ttcagaggga 


atgctagaag 


aaacgtgact 


cttgctcaaa 


atatcaaact 


accatgaaNc 


33961 


ggaaatggcc 


aggatcagaa 


ttactaggct 


cagttcatac 


atcaaaaagt 


gaggtggtca 


34021 


gagtcccctc 


cttcaatacc 


cagcgcagga 


acactctgcc 




ccctcactcc 


34081 


tcaggcgacg 


tctcctgtgc 


ctgtccacac 


tctgctgata 


cctctgcaga 


tgcctcaggg 


34141 


tggggacggg 


gctcctccac 


ttccaggcca 


gctcatcaat 


gcacagagcc 


cagcagggtc 


34201 


tcaagagatg 


tgctgcaccc 


ctcagtgtgg 


cactccagca ggcacgttga 


gaagacaggt 


34261 


cgcattacaa 


accttcccgg 


gtttgtacta 


acgacggcaa 


gaggctgccc 


ctccccctct 


34321 


cctgaatttg 


gggggttagc 


agagtgtggg 


ggggatattc 


ccactctgag 


gtcagatttc 


34381 


atgggtgcca 


gcatgggctc 


tgccacaatc 


ctgctgcgtg 


acttggggcc 


agttacttaa 


34441 


cctgtctggg 


cctcagtttc 


tttatctata 


aaatggagat 


aataaYagtc 


cctgtgtctc 


34501 


agggttgttt 


caggttgaag 


caacaatgaa 


aattatcacg 


tgcgtccatg 


tgaagagatc 


34561 


accaaacagg 


ctttgtgtga 


gcaataaagc 


tttttaatca 


cctgggtgca 


ggtgggctga 


34621 


gtccaaaaag 


agagtcagca 


aagggtggtg 


ggactaccat 


tcgttggtat 


aggtttggga 


34681 


tagacggtgg 


agttaggagc 


aatttttttt 


tttttttgag 


acggagtctc 


tctctgttgc 


34741 


ccaggttgga 


gtgcagtggc 


atgatctcag 


ctcactgcca 


gctccacctc 


ccaggtccat 




gccattctct 


tgcctcagcc 


tcccgagtag 


ctggggctac 


aggcgcccgc 


cagtgcgcct 


34861 


ggtttttttt 


tttttttttt 




agtagaggca 


gggtttcaca 


atgttagcca 


34921 


ggatggtctt 


gatctcctga 


cttcatgatc 


tgcccgcctc 


ggcctcccaa 


agtgctggga 


34981 


ttacaggcgt 


gagccaccgc 


gcccggccag 


gagcaatttt 


ttgtgggctg 


gggacggatt 


35041 


ttacaaagta 


cattctcaag 


ggcggaagaa 


tattacaaaa 


tatcttctta 


aggtaggggg 


35101 


ggacaatatt 


acaaagtatc 


ttcttaagga 


tgggggtggg 


gaagaatatt 


actaagtatc 


35161 


ttcttaagtt 


ggggggagag 


aatattacaa 


agtatcttct 


caagggtggg 


gagggtgtat 


35221 


catacaaagt 


agattcacaa 


gggcgggtcc 


ggcgggggcg 


ggtggcaata 


tcacaaagta 


35281 


cattacccca 


agggcggaga 


gggtgtattg 


ttcacaaatt 


caattgattg 


atcagctagg 


35341 


gtggggcagg 


aacagatcac 


aatggtggaa 


tgccatcagt 


taaggcagga 


actatctatt 
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35401 ttcacttctt ttgtggatct tcagttgttt caggccatct ggatgtatat gtgcaggtca 

35461 caggggatat gatggcttag cttgggctca gaggcctgac attcctgtct tcttatatta 

35521 ataagaaaaa caaaatgaaa tagtggtgaa gtgttggggt ggtgaaaatt tttgggggtg 

35581 atatggagag ataatgggtg atgcttctca gggctgcttc gagcaggatt aggggcagca 

35641 tggaaaccta gagtgggaga gattaagctg aaggaagatt ttggggtaac aggtgatatt 

35701 atggggttgt tagaaggagc atttgtcgta tagaatgatt ggtgatggcc tggatgcagt 

3 5761 ttggtatgaa ttgagaaact aaacggaaga cacagcgtcc gaataaaagg agaaaaacag 

35821 gtattaaagg actaagaatt gggaggaccc aggacatcca attagagagt gcccaagggg 

35881 gttcagcata attatttgct tggttggcga gtttttgggc tctatccttg agtttatgtt 

35941 gtcatacacc aggccagact gatttaggta aaaacaacac ctaggtgact caaaagcttt 

36001 attgctcaca caaagcctgg tggtggtctc ttcacacgga catgcgtgac aaaagtgaaa 

36061 tccccagcac ctagtacagg ctgaatgaaa ctgtcatcaa attacaggag ggtccaacag 

36121 cccaagctgt ttccctggga ccccatgcac agaatggtcc acctgggctt cgcccttaca 

36181 cccaccctgg ctctcctcct ggccatcacc atcgggagag gtctgagtga acagcttgca 

36241 catgcccctg tgttttccag agctggaggg gctgctgtgc ccaggcacRt ccgcgagtgt 

36301 gacccagccc tggagctctc cccgccggct gcgtcgcctt Cggggaaaca acagccacgc 

36361 caccacttag ggactgacct tcccacccgc tgaccggatg cctcatcaga aagggaaatg 

36421 ctgcagaacg gggaaggacc agtgctgctg aggctgcagg acgcagaagg acacggccga 

36481 ggcctcaggc cagagcccag tgtcccccag gggcctctcg gccactgcga gccactgaca 

36541 gggaaaggcc gagaaaagtc agggattctR attcttgcta gcagtggctg caggacggcc 

36601 ctgcctccct ggaactctgc tgagcttgtc ggctgtccct ggccacagag ccctgggggg 

36661 ctcagtgagg agtttccttt caaagagggg ccctaagtgg ccagggctcg tgccaccccc 

36721 ggccctggca tccccccaca cgcccgtctc agaggctgtg tggcccggcc catcccgtga 

36781 ctgagctcgg tcacctgggt ccgtttgagc accaccatgt aatttgagaa caatgcattt 

36841 ttattccccc acacatcata ttctgccgct gctctgtcac cagtgtgacc taaggaaaga 

36901 gggaaggaaa atgccgctat gggaggagag agctgcggca tcaggggaca ggaacggtgt 

36961 gggggagggg gcagcaagca gctgccgggg tggggggcgg gctgctgagc ccacaccagg 

37021 cttggagccg gagccctatc cctaccgctg cccccgcctt agggtgagat gcagatagtc 

37081 cccggggttc tgggctccac ttcccactga gatgagaccc tctgctgagg gtggaggcca 

37141 acccacatgg tggccaagca ggtggttaga atggagaggg atctcatgtt aatgaatggg 

37201 cctcaggcag cacatggatg cccctcgctc cagaaggggg tgtgtgggat gcagcagctc 

37261 ctgctggcga gggccagggc gtgggtagct ggcactggct ggcagaggct gtagagtgga 

37321 cacccatcaa ggcatagggc ttggactggg caggggtcca gcatgcctaa gtggcccagg 

373 81 cgtcagaccc gaccacctca gagcaaagct gaggaggact caggccagcc tccctactcc 

37441 ccaggcatcc atgcatgatg aggaagaagg gagaggccct ggcagagcag gtatgaaccc 

37501 ctcagagact gagaaggcat gacagagggc tgcaggccat caccttgcac tgtgcccctc 

37561 gacctgggca ggggccccca aggcactcag agcgcagggc aagagtaagg tgaaggctgc 

37621 aagaagggtc ctgcattaca agctcacggc aagggtcgtc Cgctgccagc gtatccctcc 

37681 ctgaaagcgc agtcatgcag ggtgctttcc aacacccaga gcagatgttc cagcgcctct 

37741 tgggcttggc gggaaacggc tttgcccttc tgtgcagcag gccacaatcc ccctgcacca 

37801 ccagagtctg tttatgtggc ttagctccct tgccagactc tggcagatgg aaagatagat 

37861 gaaatcctgc ccaccgctac acaggaaacc cagaaccaaa ggaagagggg ccgaattatt 

37921 ggaagcacaa gggtccgtga tcacttctgc tcaggatcag aagggccagc atgggcagcc 

37981 cccaggccag gctccctgcc aggcgcgggt gggaggtctt gcaggctgca gccaggtttc 

38041 cttcctcctg tctggctgca gggaagtttc ccttgctctg aagtttcctc actctgtgaa 

38101 cattcccacg gggcacccta tgctgcctcg ggtgctgtcc catcctggta acttcctggc 

38161 accatcgtga tctccagtcg ccttcgtttg ttagctgtct gtccgtctcc actaggcagg 

38221 gactgttctg cattcttgct gatgcaWcct gggtgctcag tactcccgtg tttccccagg 

382 81 cttcaggacc aaggccaagc tcagccgcac gcgccttctc ccaccccagc gggctcccag 

38341 ggtggccgag ggctggcacc tcttttgcac aaggggctcc tctgtgtgga atgatgccct 

384 01 cccactttcc acttgggaga caccaactct gtgaccaaaa ccagctcaag gatcacccca 

38461 ctgtgactat ctccacKacc aggtggagac agctgaccgt ctcctttggc tgccaagRcc 

38521 ctgtcctcRc tgtgccaagg caagcaggtg gcccccatga gaaaaaagct tgtccaggac 

38581 gggagctggc tggggctctt cccagacatc atgggccgag tgaacaactg ctcgcctcgc 

38641 tggtcggctc tgatcttctc cgcctaYgcc gagtacttcc ccgcgtgcga tatctttccc 

38701 ttcccacacc ctgcgagtga gtgttatccc cgctgtgcca gtgaggaagc cgaggcacag 

38761 agagattgag aggctcacat aagtgggcat gggagtcagt acagaggctc tgaggagcaa 

38821 tttggcaaac ctgctgcaac tagaaactgt atcccgggac ctggccattc agacgtcagc 

38881 atggacgatg gagacacagc ctctgtcata gaagcgaagg cataaggaca cccagccacg 

38941 aggagttcca cacccacaga tagagttcat ttttatggga gtctctgaga tcacagtcgc 

39001 tttatgggaa ttcatttagt cacttattca ttcaaaagag atttactgag cacctactac 

39061 gtgccaggcc ttttccaggt actcagtgaa aaatcaggtg ggcagaccct ctctgggtcc 

39121 gatacaaagt cacctttgtg ggagtcccta accctgctga gacacaccac caaggagacc 

39181 cctggactgg ggaggaagac ccaggattgc ttgactgcaa ctcagctcca ggggcagcct 
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3 9241 gggaggggcc tcagggagtg agagcccacg ccaggtcagc ccgacagccc ggcgtggcac 

39301 atcatcgcct ctgttcttgt ggttcagctg tcacctcctt gctggagcct gcacaagaca 

39361 agctccgagg ctgcctcaag ctgtcatccg ggtgtgtgga tggctggccc tcctcgtcca 

39421 cagggactct aactcccatt ctgacaacca tcagttcccc gcgggaggcc aaggacagga 

3 9481 tgctgggagg cttccacatc agctatcagg aggtctgcgg aggagggagg agggaggagg 

39541 tggggagatg tccactggtg gggagggtgc tgcacatttc aggaggaagc agaggtgaga 

39601 tccacccaca agacccctgt tcgagtgttc cccacctgct caccatcact ggtctgtcgg 

39661 cctccccagc cctgctggaa catgtgctcc ctgcggtctg gatcccgtgc tcaccacgac 

39721 atccccagca cctagggctg ctctatccgg gtgcactaag ggagtgagtg ggcaggcggg 

39781 Ygggcctcag aacaggagga cacagtagat ctatgcagaa gccaaggcaa agcaagaggc 

39841 agagcggatg gcctgYgggg agaggcgggt gtggccaggg gactccaaat gctgtgggga 

39901 ggtcagtgag gcctgagatg agaaccYgcg gagcatgcta ggggatttgg gagctctcga 

39961 tcttcaggcc agacagtagg ggaggaccag cacaggggcc accttcccac aggtatgtga 

40021 ctccatctgc cgtgctctgg ccacgtctct ctcagtctca gtgacctgat gcaggcaagg 

40081 gaacctcaca gggccactgg ggcaaggtag tgaggtcaca ggtgtgcctc cttgcacagg 

40141 gccagcacgt gcaaaaccag caaacgcgag ttcccctcct caggtgggga ggggtgtttc 

40201 caaggataga aagcataagc cccaaacctc ctaagagaga ggagtctatc tttgtagacc 

40261 acaaatcaac ctcgctcccg ctctccaagc cccacaagct ccagcagggc ccaatctgcg 

40321 cactgcgatt cacgatgcgg ccaggccttt gcgcgcgctg tgccaccacc aggagtcctc 

40381 attagaggca actgcggtgg cccttgtccg aggagccttc cctgaccagc cgccccaggc 

40441 acccttccct gatgaaccaa ggtgtaatca cgtgtcctga tcccgctccc catactcctg 

40501 tgctcYgagc tcccccagtg cagggatgcg gaggccagga gtccagggag tgccgaacat 

40561 gactgaggac agctcccaat cccagcatgc tcctcccaca cccgtgctct actgagcact 

40621 ttcctggtgt tcaactcgag tggtgctccc aacagccccg ggagttggca ctgaccagca 

40681 cagggcgcag caggacaggc agtcgcctcc ttccttctag gctggaaaga acactcaggc 

40741 cacccacaca ccagacttgt cctgcagccg tgttctgctt cacctgtgca gtgtggcttc 

40801 ttaatttcaa tctgtggccc acattgaaaa ataattagat ttcccaaaaa aagtagattt 

40861 ccagcttctc atggaagagc gagctggcag cagtggtctg cttattccta cagaaacaca 

40921 agcgagtgga accacgagga gctctgcagg ggggcgtggc ctctccggtc gcccgagtcc 

40981 ccaccactcc caacagcctt cagccacctg ggcccctcgg gcctggctac tgtgagcacg 

41041 tgagggcact gccctcttct gagccttgcg tgcagcctcg cacacaccac tggccctcat 

41101 ggctgtgtct cctggtggca cacagggcca ggctgtaagg ccagcgggac aagcgtgagg 

41161 aacagaaaca gtgccagcaa cacccccagg gccacagcac cactagcaga ccctgaaccc 

41221 cgcttgcctc cccccactta agggacccgt gtttcgtgtc cagtttcaga gatgttcttc 

41281 tctttcccag gaaccagagc cagcgactcg tgaactatga cttggggcga cgccttccgc 

41341 ctggctggtt tgcatctctg cggccagttt acgagatttc caccacgacc agctgaaaac 

41401 actcctgtcc gaccactgcc ttccccaccg ccaccatgcg gtctcagcac tcgtgaaagt 

41461 tggtgagaaa ttgagcccat atgccttcgg attgtttgca aaccctggca aggctctggt 

41521 aagtccaggc caatttctgg attaaattag aaatgaaaag aggaaaaata tctgaatcct 

41581 ggcatagaga gaaatcattc attcaaaaaa aacggtcgta gaacatactg tacagagaat 

41641 gttctcgtgc tgggaggggt caaaggttat agccgcgact gaaggcgcag aagatccttg 

41701 gattccacgt tctccatagg cctaggagcg tgtctgccct cagcagaaga gatgctttgg 

41761 gggctcaggc attctcccag aacataagca gttaacacag ctgggagaga ctattctgca 

41821 tgcccggaaa cctgctcctc ctgctctgcc gttctacagc ggttaagaSc agttcattct 

41881 atcctttaaa aagaatcccc gaagtagcct caccggtcct ttgagctgat aaggctagtg 

41941 tgggggtggg ctggggtctg agcagcccac tgtgctcagt acttccctaa tacgtacacc 

42001 cacgtgcact gatgtagtga aaggcacatg ctcactttgg aggcctgggg ccatcacagc 

42061 tcaaggcagg cactaagcaa gtacatttct gcacagcctg gtagcaaaaa ttcctctagc 

42121 acctcatagt gaatgttctc agctttgtgg gccatatggt ttctgttgca actacccaac 

42181 ccactgttgt agcacaaaaa tggccacagg caattcataa atcaataggc gtggctgtgt 

42241 tccaataaaa ctttatttac aaaaacaggt gaagggccag gtggccctgg ggccagagtt 

42301 tgccaacaca gctttagaat ctccagttta ccttgacaaa ttcacgtgaa ttttgctttc 

42361 atgcaagtaa tacgtccaaa tttactttaa tgtaaaactg atgttttaaa atgcttacca 

42421 tccaaaaaag gggccacttc agggggagat gYcttacatc tgtctggtgg cctctggcag 

42481 gcctgagagt acgtgttcct ggtagcaata ggttgagcgc tggccttgct gcaggctggg 

42 541 tatcacccac tgcaatggtc tgcctcctgt ctacaaggag tgcggtggga ccaggggctc 

42601 acaaagacag ttcaaggtct ttgtagatcc tgatatttaa gcaacacatt gaggccttag 

42661 gcttctcggg aagagtagga ttgaaattag cgctttagtg aaatgcttga agccacaaag 

42721 gaagcttcag ccttgggccc aagctttata tttcttgaag tcttggcaca gatagatttt 

42781 acttttaaaa atgtatctgt agccaggcgt ggtggctcgt gcctgtgatc ccagctactc 

42841 aggaagctga gacagcagga tcacttaagc ccaggtgatt gagtccacca tgcagcatag 

42901 catgaacctg tctctaaaac aaatctttaa aaaatgagtc aggcatggtg gtgtgtgcct 

42961 gtagtcccag ctgcttggga ggctgaagtt tgaggctgca gtgagctatg attgcagcac 

43021 tacgctccag cctgggtgac tgagtgagac cctgtctcaa aaaagaaaaa aatgcatata 
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43081 


taaacatatt 


acttccacaa 


cggctctgat 


ct tccaaggc 


attcatccct 


gcctcccatt 


43141 


tacctgagcc 


tgcagaagag 


tccctgtctg 


tacctgggga 


cgaaacttga 


ggtgccacac 


43201 


acatctgagc 


tgcctcctag 


catggcagcc 


tcgcctgtca 


ggcccaagcc 


cactctttag 


43261 


gggtgcccag 


gcccacacac 


atggccacag 


catctggttt 


gtgagcagag 


actagcccag 


43321 


gcgctggcag 


agcaagtttt 


tgaaaatgtg 


agaaatacca 


aagccccagg 


gtcagtatca 


43381 


ggcatgttca 


gcaatgtcac 


ctgtgtccca 


gcctggcagc 


acccagtact 


tcagggagta 


43441 


tgtttcccaa 


gcaggaaagg 


tcaaacatga 


cagtggtccc 


ccaatgccct 


cgggggcggg 


43501 


gggcgggcct 


gggcagagga 


gagcccctta 


tcctcccggc 


tgggcccagg 


ggaggcagca 


43561 


acgcaaggtg 


gaggctgcag 


gctgagccag 


agtagaaagc 


tgggaggagc 


aggagcagaa 


43621 


tgaaggcaga 


gggaatagaa 


gtcactctgc 


cagtgacggg 


gctggggggc 


cagcaaagct 


43681 


ggatgggcca 


cacatctgcc 


cacggccctc 




cctagcctca 


cagtgcttgg 


43741 


aagggagacc 


cctctgccct 


tagacggggg 


aatgtgggca 


tccccaaggg 


gcctcagcat 


43801 


ctctgaccag 


gctaagggcc 


gctgagcctc 


aggagaccag 




gggttgccag 


43861 


cagccaggag 


ggctacagct 


gcgtgtgctc 


aaactccagt 


gtaggccggg 


agcagtggct 


43921 


catgcctgtg 


atcccagcac 


tttgggaggc 


tgaggcgggt 


ggatcacctg 


aggtcaagag 


43981 


ttctagacca 


gcctggccaa 


catggtgaaa 


ccccgtctct 


actaaaaata 


taaaaattag 


44041 


ccaggcatgg 


cggcacgtgc 


cagtagtccc 


aggtacttgg gaggctgagg 


caggagaatt 


44101 


gcttgaacct 


gagaggtggc 


ggttgcagtg 


agccgagatc 


atgccattgc 


actccagccc 


44161 


gggcgacaga 


gcaagactct 


atctcaagga 




aaaacaaaac 


tcactgcaga 


44221 


gagcctctgt 


ggacaagggc 


ccgccccctg 


cctgccacag gctcagcaca 


cacccagtgc 


44281 


cctcttagca 


cggctggctt 


cgcctgggcc 


tcagaggtgt 


gtggacatgc 


ccaagcagag 


44341 


gctggggaaa 


ggccacagag 


aacgggagtc 


tgaggtggca 


tccagggggc 


acatgcaggg 


44401 


cagcaggtgt 


tccctcaggc 


tatggggtct 


gagcgcaggg 


ctcaggtttc 


tgggctcgcc 


44461 


ctagggacag 


gctgacttca 


tagatgacca 


gggcctatcc 


tcagaaggcc 


ccgaccttgg 


44521 


ggcttgaccc 


tctgcagctc 


tgtcttgaag 


ttcttcgtca 


tctcaccctt 


gcatgtgtgt 


44581 


tttgtaagtg 


aaggctcctg 


gggcaatgca 


gcatggtccc 


atctccccag 


gacaggcttt 


44641 


cagctgcccc 


cagcaggggc 


aggatcaggc 


atgtattccc 


cacggcatct 


cagggcaggc 


44701 


gtggtggtgg 


ccatcccgcc 


cacggctggc 


aggaaagggg 


catgtctggt 


ggatggctgg 


44761 


ccagagcccg 


tgtttcggca 


gaggtgaRcc 


tcccgtgagt 


gacacctgca 


tagagatggg 


44821 


attggccgtg 


gccatacgct 


ctgggctgga 


gcagtgcacc 


agtaggaagg 


gggcgtgcct 


44881 


ggctctactt 


ccccagacgc 


cctggccagg 


gcacagccaa 


tgtgttcttc 


caggggtggc 


44941 


aggaagctct 


cagaccccag 


tgggtgtgcc 


cttcctgctg 


ggggtgcact 


cactctgcta 


45001 


ggaccccgta 


cctgaggagc 


ctcccctcgg 


caggattcca 


ctctgtgaaa 


tcctctcttc 


45061 


ttcctcccag 


cctccctgag 


tctggctgtg 


cctgtcataa 


acttctggtc 


agcctgaggc 


45121 


acccatcagg 


gatcagccac 


aaaatacaca 


tttgtttaat 


ttcagtgacc 


ccacatagaa 


45181 


gctaaatgct 


ctaatatttg 


catttaaaat 


agccatcgca 




atgaataaga 


45241 


aattcatgct 


aataattcaa 


ggttccaatt 


tttgttcact 


tagaacaata 


ttaaaaagca 


45301 


aataaaaaga 


acagcgagtt 


gagagacaga 


gagacagaga 


ctaaggaaga 


aaggaagtgg 


45361 


cttaatagtc 


cctttgtatg 


gcaccttcct 


ctgctctttg 


aacaagcgat 


cccacatttt 


45421 


cattgtgcgt 


tggggcctgg 


gagttactac 


tggctctgcc 


tttaggccag 


tgttccccac 


45481 


tggtaatatc 


ctgacgagtt 


ttgcagcagc 


cagatattga 


ggggaaataa 


gaatgcattg 


45541 


catgtaatac 


attggctcct 


ttgaattctc 


tatctccccg 


ccacaccttt 


ctttcctttt 


45601 


tttttttttt 


ttttttttga 


gagagaatct 


cactctgtcg 


cccagactgg 


agtgcagtgg 


45661 


cgtcacctca 


gctcaccgca 


acctccgcct 


cccaggttca 


agccattctc 


ctgcctcagc 


45721 


ctcccgagta 


gctgggatta 


caggcacgtg 


ccaccacact 


cggcttattt 


ttgtattttt 


45781 


agtagagatg 


aggttttgtc 


atgttggcca 


ggctcgtctc 


gaactcttga 


cctcaagtga 


45841 


tccgcccgcc 


ttggcctccc 


aaagtcctag 


gattacaggt 


gtgagccact 


gcactaagcc 


45901 


tatctttcct 


cttactttgg 


gagaaattca 


ggtggagggc 


tgctatggct 


aaacccaaga 


45961 


acagtcagcg 


gttggtgagg 


aactggagct 


tgggaagtac 


agctgccaag 


tagggcagaa 


46021 


actcagaaga 


ttaacttctg 


gggagggcag 


gaggcatcMa 


acacatcacc 


caagggagaa 


46081 


gcttttcccc 


tgggaaggga 


aagactgcag 


gatctttccc 


tgtggtcaca 


cacacaaaaa 


46141 


tgtaggacat 


caaaaactat 


cccaggcgga 


ataaaacaaa 


aggtcaacaa 


caggactggg 


46201 


ggaaatattt 


cccacacatg 


tgacagacaa 


aaagtttatc 


tacttaagta 


cacagactct 


46261 


tcagatgaat 


aaaacacccc 


caaatcccag 


cagaatcaag 


ggcatgaata 


ggccaagcaa 










acgtatattt 


ttaaatgcag 




46381 


atcaaggaaa 


taacaactta 


caacaaaaaa 


acttatctcc 


gttgtaatta 


aattaccact 


46441 


gctcaaaagg 


actcagtggt 


gtgaggaggt 


ggggagacag 


tcatttgctc 


aatgaagcgg 


46501 


gaatagatga 


aaccataaca 


gagaacattc 


cagtgtagaa 




atgggcatat 


46561 


ctgttggcca 


gagacttcgc 


tttcagcaac 


tcagcttaag 


gaaataagta 


agcacgagcg 


46621 




ctctatgaac 


gtgatacagc 


tagtggcatg 


tctgacctct 


aatggagaaa 


46681 


taagttttgc 


tatgtgaata 


tgtcatataa 


ataataagtt 


aggtaaaaag 


tgtgtgacag 


46741 


tcatatggtg 


cgatgttatg 


caatcattaa 


aacaacgtag 


tagacaatta 


actgctggga 


46801 


gaagacattc 


acatacctac 


aggaatttat 


gctaatttta 


catagYaaaa 


aaaatgtccc 


46861 


ggcacggtgg 


ctcacgcctg 


taatcccagc 


actttggaag 


gccaaggtgg 


gtggtcacct 
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46981 


tacaaaaaaa 


aaaaaaaaaa 


aaaattaqac 


ggecgugg tg 


qtqeatqect 


qtaatcccaq 


47041 


ctactcaqqa 


t 


qaaqaatcqt 
99 9 9 


ttgaactegg 


qaqqaaqa 
a act t ? t 


tt t 
gcag gag 


47101 


ccaa atcac 


accact ta 


ccagcc gg 


aatat^act 


attt C C t 9 


c cacaa aa 


47161 


ataaataaat 


aaaataaat 




99 


caa aa 


accaaaaaac 


47221 


at ctcttct 




t 

99 9 


gaaaac gga 


cccagcaggg 


gc gc ggag 


47281 


tgggcaggtg 


t tt 




9 


9 9 99 


99 a 9 


47341 




tttccaaacc 


cttcccttc 




gaga gaaaa 


gggcagcttc 


47401 


atctttccca 


99333999 


aatatcac t 
9 


ctt tcttta 




gaagcagccc 


47461 


etc aotaat 




cccggagcca 


caggcggctc 


ccctccacct 


c ggaagac 


47521 


gcggcggccg 


9 ^t fc 9CC " 


cc° C ccccat 


tcc^cctca 


gccccc gc 


gaeaggeca 


47581 




aacct °aac 


ccggcccca 


tat 9 tac°t 


gggccttggc 


gccgcc gec 


47641 


ccctccccca 






C 


c cccaca 


ccttccgttg 


47701 


tc t a cct 


W t 


3 atatctcca 
tt ^ tt t° a 


9 g^ ac ^ a g c 


t^ttt^ttt 


tagaaagtgg 


47761 


cct ccttcc 
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50761 tgtaatcaca tcaaccccca cgatgaaaac actggcccat tttacagatg agaaaacYaa 

50821 ggctcagagg acRggagcag cgttcccaac gacgcgcctg gaaaaggcga ggcctggagt 

50881 YRccatcgga ctgtaattcc aagcccagtc ctttccccgc tgggttggaa ggcgacataa 

50941 tgctagggga agaactcata ttgtgaaggc agccaagatt ttatacctgc cagtctcaca 

51001 gacctagcat gaggacttcc caagataatt tagcatggca agtgcccagc atgggcccgg 

51061 cacagtcaca tagtaggggc tcgataatgt agccattatg gttttattgt tattattatt 

51121 attattatta ttattatttt tgagacggag tctcgctctg tcactgaggc tagagtgcag 

51181 tggtgcgatc tcagctcact gcaagctccg cctcccaggt tcacgccatt ctcctgcctc 

51241 agcctcctga gtagctggga ctacaggcgc ccgccaccac gcccggctaa ttttttgtat 

51301 ttttagtaga gacggggttt caccgtggtc tcgatctcct gacctcgtga tccgcccacc 

51361 tcggcctccc aaagtgctgg gattaYaggc gtgagccacc gcacccggcc gttattatga 

51421 ttattgcatt aaaggtacgt ttcaaaaata agatgagggt aagtgtgaca acgctctccc 

51481 aaataaaacg aattctgccg acgcagcgga cttcggcccc ctgtacggat aagRctgggc 

51541 tccagagggc tagctgactc acccaagacc ggtcaagagg aaagtgagca tgaaggtggt 

51601 gtctctgtcc agtgttcagc tcactcacat cccctcctgc ccctgaggta tcttttcacc 

51661 agggccctgg ggactctgac ccatgtgtca gcttggctcc tggaggggct cacccctggt 

51721 ccgaaaagca gagctgggca gcagatcctg agcagacctg cacaccctgc tgcctgcctc 

51781 tagcccggcc ccagaggacc tcattcatcc acaggtgtgg acacgcagca cggggttcta 

51841 gaacttccca cctgcccggc ttccacacca cctccagccc tccctaagcc ccggcttcct 

51901 cccacaaata cccccctgcc cctagggccc cactggcagc atctcacatt ccaggccgtg 

51961 ccctggtcct ccttctcacc cccttctttt ccctgcagga ctcagccaag ccagtctgct 

52021 cccctgggag cctgcatttt gaagactgga cgcccctcat ccccacagtg gctctgacca 

52081 catgaggcag agagaggaaa ggtctgcaca gcctccaccc acacacggtt cagcatcctg 

52141 gcacctcctg cttaggttgt tgggggtatg gaatgagggg gaggtggttc tttaatctct 

52201 tagtgttagg cagaccatgg gcaggtgccc agaagacgaa ggtggcacat agacacccct 

52261 ttcaatagcc cagactcaca gggagaaagg aattccctcc ccaattcctc ccactgYgag 

52321 ctttaaggag gggccctggg ttggtgtctg gccctcgctc taatgcaYgt gaatctctac 

52381 tttaaacaga aaaggggctt ctgtctggag cgagaagaag gcagggctat ccagctgggg 

52441 ctgaataacg tcactttatg tcttactatt atattagctt ttcaggtaaa ataaggttgc 

52501 ctttttaaat gattttagtt ttaaaagtgg caaacatcaa gaaaacatat ttttcaaatg 

52561 acgggacatt gaatgacatt gtcattcaaa cgactggagt ttgcacaaac ccgccttagc 

52621 ctgtgaagag cctcaccaag ttaccatcac ccacgatgtg cagacgagag aactgaggtc 

52681 agggatggtc aacttgtctc acctggggac ccagcaagcc tccctgagac tgtgtcactc 

52741 acctccggct aacaaggaaa tcggcagagc ctcaggggac ctggcccagc tgggcctctc 

52801 tggcctgccc agcccctggt tgaccctccc cgccccaccc tgccccctca gccaccccac 

52861 tcacaatgtt catgagtgtc agcaggtact tctgtacgtg ctccttcccg tggaactgca 

52921 ccacagagtc atccacgccc agcaggacct cgatgttgta gtcatcgtcY gcagcatgcc 

52981 tgcgtgccct ccgcctcgag ctgttggcgt gctcctctag gaYgcccagg gcgcggctga 

53041 ggctgtccag gctgtccagg gaggcccctg caaggagagg acaccgtctt cagcggcagg 

53101 gcaaacccac ccggacacaa acctaccagg cgccagcttg gccatcctct catttaaaac 

53161 tcatagcacc ctgaaccgag ggtattattc cattttacag aggaaagcga ggtgcagagg 

53221 aagatggcag agacttgcca agctgtctca gacacagcag ggcccagaca ggcacctctg 

53281 tgacttccag gaccgctcgt tccccagccc ctggtggcaa ccccagccac agcaggggcc 

53341 acagtggcct ccacaYgatg gatcagacct gctgagactg tgagagaccc agcccagcat 

534 01 caccgtctgc cttatggagc ccagggctgg cattcagcca tggagaggtc gggaacctca 

53461 gcccgcactg cctgacgctg gagtgggcag agccacccct tgcccacact gcccgatgct 

53521 ggagtgggta gagccacctc ttctcccccg aagggctgcc aggtgccaaa ggactggtgt 

53581 cccaggtgtc atcgccagct gttatgactg gggaaaccga ggcccatcct gggctgtgca 

53641 tgttccccag tgtctccttg ggcccctccc agaactttct gctaagtcct ccactgaagc 

53701 gaggccccac atctctgggc caggtcccgg gccccctccc caggtgaacg gtaccgcctc 

53761 tggcctgatt gaattgtgca gagaccaaga tgaacccctc tgtggcagct ccgctgcagt 

53821 cggccctggg caggcactga cctcctgccc ccagggtgtg cccctggcca cttggggctg 

53881 cMatcctggg gagtctgccc cggatatcgt ccctgccttc tgtgcagtgc tgtctctgct 

53941 gtgttctgtg tcctgcgggc ctggagacca ccctggacca ggcttcagtt cccagccctc 

54001 cactgcccgt ctgaaggcca cagcaaggct ggccaggctg acctgcagca tcgatggcag 

54061 ccccagacca cactgcccac cgccccccaa gccccagtgt cctggaagcc ccctttccct 

54121 tgcactggtc tctgtgtcta agccaggtcc cagccccagg gtctccatgg cattccccag 

54181 gcaggtcctc agagactcct ggcccgggac agacccaggg ctgcaggtca tcgtgacatc 

54241 actgacccac agggcggtgg ccaggccccg aggcccctgc agctgaggga gaccagcgtc 

543 01 cctgctgcac cacctctcta caaagagtct tccagaacaa cacttttcac ccccaacgcc 
54361 tctctcctgc tgcctcttcc aggcagggac aggcactccc tccctgtgag ggtggcccag 
54421 gacaggccct ggaggcagaa caaggcccct gcaggcggca cggccactcg gacagtgagg 

544 81 cagcctggta ccaggccctc atcccggcac acgcttcctc acccagtgct gacagcaggc 
54541 cactcggtgg acRtggccat ttccgtttta cagatgggga gacccatggc atcccagcat 
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54601 ggccgactct cggccagaag ctctctcctt gcaccctgca gcccggctca gacaagacag 

54661 actgaacatg aagtttcagg cacaaacctg cagcccggct gtgggcaaga ccaagacaga 

54721 cagtggaggt gcgaggcgga gggtgagggc cgggaaagag aagaggagta agtggggcca 

54781 gggctgagca gctgctgggg ttccagtacc ttcgggggtc cttgtgccac ccccagctcc 

54841 acagttctgt gtcccctcgg cgcacacaca catgcacaca cacacacatg cacacacatg 

54901 tacacacaca caagcacaca catgcacaca catgtacaca cacaggcaca cacacatgca 

54961 tgtacacagc aaggactttc cagggccact ccagggacag gggtggggcc ggcaggcctg 

55021 cgggcagggc tgtggggagc cgagcagtcc acttagctgc ctggatgact cagggccatc 

55081 catgccgttc ccagaggaag cgccagcaac tcaagcctga caggggggtt gaggaatcgc 

55141 cactctattt ttgtacattt atttctttgc atttccaacg aacagagaca gctcaccttc 

55201 agaatcacct tgggaaggag agtgatggca atttccatag gaaaactctt cctccctggg 

55261 gaacaggccg ggcgttaccc tctgcccttg tcaaactagg ggtcacacac gggcggtgcc 

55321 ctcggcaaaa ccagaccaca gaggcactcg ttgaggtcac gtggtgccat ctggttgata 

55381 gtatttgaaa actgggagat ttcatgtttt aaaaatctag aacatccagc ttctcttgag 

55441 gaattaagag atggggccac cccaggggta gcccKggtct gtgaattgag gggccacacc 

55501 tggagacaaa acacacaggg cttggctcaY caggccacct gcctgccact gtcaatgtca 

55561 ttctggcacc agaaggtatc acagtatgtc ctcctggccc taaccccaga gtccccatgc 

55621 agggacccct ccagacattc cccgtaggag ggcatgggtg ccgcatcggg acagagctgg 

55681 gctggggcac Rtcttgaggc tgcttctgcc gagcaggcag agtttttaac taaactgtgt 

55741 gttgcaagcc cattaaaaac agcaaaggtt tctaaagatt tagaatctga atttcacctg 

55801 aaatggaatt attattgttg ccatcattat tttaatcgtg ataatttKgg ggtgaaattt 

55861 ggaaattatg ttctagagSa tccctccacc caggaccagt gcctagagag acggcccaag 

55921 gcccaaaggc aaagtgggtg aggagaggaa gacaggggcc tgggccacct ggcaggtgac 

55981 agccacgccc tgcgtctggc ccctggaagc ctcagtggtg ctcaggctgg acaggacaag 

56041 gggagaagcc ctaagagcag ggggatctgc aatgccctca gacctggggc aacaggagct 

56101 ctggcccagc ccccctcctg tcctcctctg tcccaacaag acccggcagt caggctgaag 

56161 ccccagcttg tcaatcacga gtgccacctc agttttcccc acaataagat tgaccctgac 

56221 cctgtcaatc acgagtgcca cctcagtttt tcccacattg ggcttaaccc aggccatgaa 

56281 gagactctga ccacatgctg gcccttgcag acctgYtccc caaatagagc cttgctatag 

56341 ggagctctgg ccagtgggcc tggcgtggtc ctgagtcgtc ttcctagtga tccctttgaa 

56401 gtcaaccttg gccttgggcc tgcccacctg cactggttgc ctacctggtc taacagcccc 

56461 tgttccactt ggtccctcca aggcagaaga gggctatttg ctgctctccc ctcctcccct 

56521 gggagctccc tgtgtagccc agccctgctg agctgtggtg actgggagct cacacagcct 

56581 ctgctgtatc cctcggcact caccccgcct ggatgagtga tcagcgcacc catctcctgc 

56641 agaggcagga ggatgacact gcccttggcc ttcctttcct ctgtgtgcag ccttccccac 

56701 aagaaggcac tgggccccct gggggtgctg attgaccaag ccccagaagt catacccagg 

56761 tgggctgcag ctaccctgaa gggaggtgtc tgtggcagga tcccatcttt taaataggta 

56821 acacagagct ttcgagttcc tggcattccc tgtgctcagg gccccttttg gatggcagct 

56881 ggagatctct ttgctcacat cgtggccaaa Kaatgacact gtgggaagca ctccagagcc 

56941 atcctgtttt agagtctcta gaatggaaca aaaagactct atcgaagagt caatcaataa 

57001 acagggagag gatggaatga ggggaggcag cagttctgtc cagggaattc catcagggga 

57061 acgggtgggt cacagtagcc acgggctcat ccacagtgcc acagtgcaca gggcacgtgg 

57121 gcaagggctg caactgccag catcctaggc acaccccgag tctccaggca ggggtgtgtg 

57181 cacagccctt cacagtggca gctccatacc cccagcagcc atccaatccc ctcgccaagg 

57241 tggaggcgtg tcaccatcag catatcgtcc tatttcagca tcagggaaac aaaccaagga 

57301 gcaggaaaat tacatgactY ggacaagcca caccaccaga gaggaggagt gggcacgcgc 

57361 ccaggcctct gagtccaacg ttgtccacat Rcccggagcc cttctgtact tcaccagcac 

57421 tgaggggtga tttgcgRgaa tgagaattgt agaaacaagc atgattagac tagagaaSag 

57481 aaaacctggg actgagtctR caacctcggg agcatcgtgg gatggtgcgg gctctgaggg 

57541 tgggtacagt gggcaggtgc gggatctgtg ggcaggtgca gtgggcaggc acgggctctg 

57601 tgggcaggtg tgggctctga aggcaggtgc agtgagcagg cgtgggccct gaaggcaggt 

57661 gcagtgagca ggtgtgggcc ctgagggtgg gttcagtggg cacgtgtggg ccctgagggc 

57721 gggtgcagtg ggcaggcatg ggctctgtgg gcaggtgtgg gctctgaagg caggtgcagt 

57781 gggcaggcgt gggccctgag ggtgggttca gtgggcacgt gtgggccctg agggcgggtg 

57841 cagtgggcag gcacgggctt tgtgggcagg tgtgggctcc gaaggcaggt gcagtgagca 

57901 ggcgtgggtc ctgagggcag gtgcagtggg caggcgtggg ccctgagggt gggttcagtg 

57961 ggcaggtgtg ggccctgagg gcgggtgcag tgggcaggtg tgggccctga gggcgggtgc 

58021 agtgggcagg tgtgggccct gagggcgggt gcagtgggca ggtgtgggcc ctgagggcgg 

58081 gtgcagtggg caggtgtggg cctgagggcg ggtgcagtgg gcaggtgtgg gctgagggca 

58141 agtgcagtgg gcaggtgtgg gctctgaggt caggcagggg ctttgcagtg ctaagggtgg 

58201 ctgggtgttc gctgtcccct gtgcgaagct cagtcctatg ccttttctgg gctggcctct 

58261 tcacctgcga tgcccttttt gctgctgccg ctgtgctggc caaatttagg ccccctgaaa 

58321 gacccagaag gaagggagcc cgctgcctcc catgggactc agagctcttg aaagacagga 

58381 gcatgttctg cccccatctg cctcctgcac agccacactg cgtgccacta ggggacgaca 
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62281 cggctcatat cctaaagata caggatgtct atcggatagc ccagtgccct ccttggactg 

62341 acccttggcc catgggccat tgttcggccc catggactgg cttgcagatg ggcaaacttg 

62401 cacaggtctg taaggaggca atggggagct gccccttgca gcattgcttg ggtggagtga 

62461 ggggagctgg aaacaacacg ggggtcatca gtgggtgaga ggagaagcaa atgtggtggg 

62521 ggcatcccac aaagtgctac caaggttaga agtttgatat ctacatagat ggatggatct 

62581 taaaaacaga gtgccaagtg caaaaggtaa gaaacagaat gagacacaat tcagagcgac 

62641 ttacRtaaat tatggataca cagacgcaag acagcaacac tcatgttgca aaaatgtaca 

62701 ctaaggaaag gatacggttt gatcacctga aatggttgct cactgcagga gaaaatcaga 

62761 aggaagagag caggctgggg agataaatgg cttttgtctc tttctttgtt ttaatgatgt 

62821 taggacaact ggttagaggg aggaaaaggt caaattttga cctcatgcct taagcaaaaa 

62881 cagattccaa gtggatgaga tgatcaaaag gaagtcatgg aagtgctaga agaaaatgta 

62941 gcKaggcacc tttttatcat cttgctgttg ggcgggcttt gggttccaag cctagggctc 

63001 tttcccagcc ctctccttag ctgccccatc ccctgtgagc agtttcccac tctgctggac 

63061 tctgctggct gaatgcaagg gcagatggca cacttgctag gggatggtgt gaggggggat 

63121 gggcacactc Wctaggggat ggcgMaaggg ggggatgggc acactcacta ggggatggcc 

63181 tgagggcaga caggcacact cactaggtag gggatggcct gagggcagac ggcacactca 

63241 ctaggtaggg gatggcctga gggcagacgg cacactcact gaggcacctg cacacagaca 

63301 gaggtgctgt gctactgcct caacctgcca accatgtgga tgggaacaag tcatcctttt 

63361 tccagtgctt aggagtcccc aaaacgtaag tcctaatgac cgcccYgcag agccagaact 

634 21 gccacaStgc ccagaagagg agagggaggc tcagggaatt agactaaggc catgatgttg 

634 81 gcaaatggcc aggccagaat gccagcccag ttcctgcaca caagcactgc cagggcctgc 

63541 aggggccacc accRggctca ggaggaccaa agcctggagc accagcagcc tctcctccaa 

63601 catgaggagc cccacccagg ggaggtcaac gggaacaagg tcatcccctc catttcttac 

63661 Yaaccaagag agagtcgctc tgtggctgtg tattccccta aagataatca ttaagatgaa 

63721 atatgtttat gtggaagtgt aatcaggaca cagcagaaag ccctgcttgg aagcaggcag 

63781 gctctttgga ggctctggga agcagaaggc aggctcagcc ccacgggggc catcagcaca 

63841 gagccccact ggtgtacttt attaaaagat gctcattagg attattacat cccagccctt 

63901 tataacttta ttactgactt tgaattaatt Cttgcaactt caacagacac agctcatcca 

63 961 atgccatttt tatgacctga ggtggatcgg agggtgggga ggggctcctg gggccggcct 

64021 ggccatcagg agcgcatcag gaaccgctgt gccgaatctc aggggcagtc ctgacagttc 

64081 tctaaactcc caYgggatgg gctgccccgc ggacagatgg acgctccagc ccaggcctgc 

64141 tcagcctcag ccRtgcttca gggagaaaaa ggaagcacta aaggagagaa ggcctctccg 

64201 ggccagtcct cagaacatac ccccgcacag cagaggacaa aggacacccc tcagtggtgg 

64261 acacagaagg tcctggcctg cccagcacct gcacaccctc acagagccac ctggggttgc 

64321 acagcacggt ggcctgccct gtctggccat tacacctttt ctgctagtaa accaccctcc 

64381 cctatttgca ccccatatgg ctcatgggSa gccaatccca ctgcacccca gctccaggag 

64441 tgcccaaggg gtcctgttag gccactggaa cctgatgctg tgtgttggtg tgcacaggaa 

64501 gactcccgtg ctttcctctc tgctgggaaa atgtgactct gggRctgcta agtacctcct 

64561 tcccccagga gcagcagagg cgagagatga gggcagcccc gtgtgctgct gacactgttc 

64621 aggtccccaa atccagctat ggctgaaggt gatgcacccc tggacatcct aattcttcag 

64681 gtcaattcat ttagctctct gtctcttgca accacgagag tcctgtgtgc ccttacaaac 

64741 acagacacag ccaaatgtgc tggcctgaga gagacaccat ggtttggctt tgcagccaca 

64801 gctctcttga acgattgagg catcagacca ggccactctc ttctgccctg tgcctgcacc 

64861 tcccagtgtg gggcgaggag aaggtctgca gtatttaagg tctatgacca gaaaaggggc 

64921 caggctctgg tccggctccc agatacgtgt ccttgggaaa tggtgcccag cagaggtgtg 

64981 aggctgctga ggttgagccc aagccagcag ggccatgctg agtgaccagc agctgtgcga 

65041 gcccctctga ggtgtcctgc agcagggctg gggcttcctg tctacagccc tcacagaggt 

65101 tcccaaacac aacttccccc tcttcacccc agggctgggg cctcctgtct acagccccca 

65161 cagaggttcc caaacacaac ttccccttcg tccccccagc acccgccgaa ctaccgaggc 

65221 tcctctgcac ctgccccgtg ccactctcca gcggccYctg cagcccactc acctccaagg 

65281 attcacgggg gtctcggggc cttcagaggc tggggcgggt gacaagagcc agcaagccct 

65341 gcccctgcct agcatctgct taagccaggg agggtcccca ggactcacca cagcccccag 

65401 ggctccaagc cttccctctg ctgggcagtt ggcccaggga aggggagctg gcctgggtaa 

65461 tgtcacgggg gcaccgcagc acgcggcagc caagtggtgg gggccgatgg agaggccggc 

65521 aggtctttta ttgaaaatat tcactgtgaa ctccagaaca aaacgaagat ccccacgggt 

65581 cccaaaggaa gggagaattc ccaggggttg tgttctgaac gcagaRgagg tgggggccgg 

65641 caagtggggt gctgacagcc agtgccacgg gacccatgtg gccggaggct tggccagtgg 

65701 ctcagcagat gacaggcgga aaatccctgt caccagtcYg gtacatgctc cagagcccct 

65761 gatttggaca tcatctggaa gcctggcatt gctccccgga aggcctcagg tacctcacac 

65821 tcagaaggtc cagggcctgc tgcctcctct gcccgcacgt gcacctccgg ctgcactgcc 

65881 agctgcatcg tccacctagg gagtggggcc cgcatgcagc caggtccatg ccaggagcag 

65941 aacctgcctc tccccctccc tcaaaggcag ctccctcctc actcctcagg gtctctggaa 

66001 cctacccctc cacgctctat gccctctgta ctgcccacgc tcctgactcc agcagcatct 

66061 gcgcctcccc catctctgca ctcttcaaat ccaagttcca aggcatttct ttacctgcca 
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66121 acctcagttt cccttgttgt ggccttcacg tcctttgatg ttgagccctg cgttcatgca 

66181 gggtgggcgt gggccctgca ggggtgaggt cctggtcccc tgggaagagc ttgcagccaa 

66241 cacaagcggg aggtctcaga tgctgcccga ccctgtgcaa atgaggccct ctcctacccg 

66301 gccatgagcc acgtgcaggg actcagcacc tgcctctgct ccctcctcct tcctgacccc 

66361 tctaggtcag gagtccaggc acacagcagt gYtggggtgc tgcagccaag cacaggctct 

66421 tcctagctga ttgtgggacc tggggcagcc ctgcacggtg ggccacactg tgccaccaag 

66481 ctgccttcct aggaggaaga gttcctcaga tgcccaggca ccagggactt cacacacctc 

66541 ctgtttcttc ttcatggtag ccggtggagt gggcacggga cctctcttac gcaaatgagg 

66601 gaacaagccc tgggcccagg caacaggcac ccccgcgtga ccccagcatc tggctctgtc 

66661 ccccgcatcc cgagcttcca atggcccatg gtggtgattg cacctggtcc agcagccaaR 

66721 agggagctga aagtaccagg gccctggccc agctgaggac cttgctgcca tcacccacca 

66781 tgacccttgg gcacaaagac cccccacttt cagtcgtggc accaagcaca gtccatgtct 

66841 gaccaggacc tgctcaagga gcaggagtga ccgaggacaa ccccggggac caccccagcc 

66901 ctcccccatg ctgggggtgg aagggtggtg aggcctgggg cctcttgatg ctcccatgcc 

66961 ctgatgtccc atcctccagg acgctcccat gcccctgcgc ctcctgaggg ggcagggaaa 

67021 ccccatgtgg gcgggcagca gccaggctcc ctggatctgc ccagagaaga gccccccgca 

67081 gaggtgatgc aatgctcacc acgcccccct cagctcaaga ccctggaact tgccagaccc 

67141 tcagccctct ctcccagtga gtgtcaggga cctccctttc ccctcaggct gctgggcccc 

67201 caaaccacat gtcctccttc cctctgctcc agccccccat ggggctgagg agcaggcctg 

67261 cccaaccttt gtttctggtc tctcctgtcc ttcacgcctg gctgtgggat cttctctcag 

67321 acctgcctct cccacgggta aatcccaagg tgcagactca gagacagcac cgagaggccc 

673 81 ccagtgcagc cgccactctg agcccagagg ccaggccatc caaccctgca gggacggggg 

67441 ttcctcccat ttgtcttcct gtgtgaactc cgtgggtcca gaagcctgag tttgggggtg 

67501 gtgacaattg gccctggtgg accccacaca ctcctgtccc atgatgcttc ctgccctaga 

67561 atatcccagc tgcagttgag gagggaaggg gctcctcata ctccccacac agcacccaga 

67621 ccacggtgtt gacgaggctt aatttgatca gccctatttc gagaagggag gaaatcagaa 

67681 atcagagtga gtgctgagtc ctgaaaattt aatccccaag gcagcctcac agggcctttc 

67741 ctccccaacc ggggcaggtg gcagggatgc actacaagcc cagatcccag ggccaccctg 

67801 agaaccagga ggggagggag ggccaccctg cctgcagaga ggatggcaaa ggccctgctt 

67861 gccgggcccc agctgccctc aacactgcaa gggctcagaa gactggacca gcagctggcc 

67921 gggcagaaac tatagaaatc atcctaattc tactttaaaa ggctggggtt tcctcttctt 

67981 gcttaaaagc tggtgtgacc cttccaggta gcagccagcg gcaggcctgt ggctcactga 

68041 gatgggctgg gtgggtggcc ctctggcctg tgctcccaac cagctgccca tgtgctcctg 

68101 cctgccctgc agaggtgacg tggctgccat cacagcattg catgttgtga actgactcct 

68161 gccaagccct gagagggcat caagtctggc agcagcccct gcagggccca gctgcctccg 

68221 aacctccagc atgcttctcc cagggcagga gcaccctccc cagagcaccc agtcctgttg 

68281 ccactttctg ccaggcaacc cctccccctg cttccaggtc cagcctgact ttcctcctgt 

68341 ttgcaacagt tcctacctgc ctgccctggg cccatgccct ccaccaggac aagtcactgt 

68401 ctccttctct ggctcccaaa accttcattt tcattcattt gttcattgtt cattcactcg 

68461 tcaagtattt tgctaactac catttgccag ccttgcgtgg caccacctct aggagagcgc 

68521 tctggggtgt caccatcaca ggctggcctg ctcagtgatc actcctcaga cacatgcaca 

68581 cacacacatg cacgcagtca catacacaca tgcacactca cacgaatgca ctcatacaca 

68641 cacacacgaa cgcactcaca tacacacatg cacactcaca caaatgcact catatacaca 

68701 cgaatgcact cagacactca caaacacgca cacacgaatg cactcacaca cacgcatgca 

68761 gtcacataca cacatgcact cacacgaacg cactcacaca cgcacactca cacacgaatg 

68821 cactcacaca caaatggact cacactcaca tgcacactca tacgaatgca ctcacacaca 

68881 tgcactcaca ctcacacgaa tgcactcaca cacacgcgaa tgcactcgca cacgcatgca 

68941 ctcacgcgtg aatgcactcg catactcaca tgcacactca catgaatgca cacacacata 

69001 cactcacaga cgcacactca cacatgaatg cactcacaca cacgaatgct cttacatgca 

69061 cacaaacgca catgcataca ttcatcacat gtgagtgtat gtgagtgcat gtgcgtgtat 

69121 gtgagtctat ggatgtgtgt gcatgtcaca cgtgtgtgca tatgtgacat gcacacWcag 

69181 gcacacacat gcacacacac aaacgtgcac atgcacacac aggctctaca gtgaactcct 

69241 ggaagactga ggtggccctt gacccctctc taagtcccca ggtacccaca tggggcttgg 

69301 caaaatgaat aaataaacgg gccaaaaacg cctatKggat aaatggaaaa cttagttatc 

69361 cttatgctac ttctgagcag gtggagacag gacagggtag gacgaggcgc agcgtggatg 

69421 cgggcggatt gtctgtatct ccttggtggg agcagttggc cgtggagtct acatcggttt 

69481 ccttcctggc cgtcatttac agggagagca tgagggcggc gtgaggtcag cggggcaggt 

69541 ggatgctgtg cgcttactgc agccatgcgg tgggtgcacg ctaccaccgt cttctcctct 

69601 ttttcttggg agctgtcgcc atggctcccg gcacaYcctg acatccgtag gcgccatcct 

69661 gacaacacgc tgccggccgc aggcctcaca ccacagcaat tctcactcgg gcctccctga 

69721 ccctcgacct gtgggtccct gctaggctgg gcgtgtttcc cgatgcccca tctctctgac 

69781 ctagacatat tttgttcaca tgttgaacag agagaaacgg tcYgcatttc cagaaagaaa 

69841 tatactctcc caattttcct tggagcatgc ggccctcttg gcttgtcttt tgcttattcc 

69901 ctcttcactg agaaatattt tgctcccRtg agaaaagcca tgtgggcagg gggatgagag 
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69961 ggcacactgc ctccagccca aggtggccag tggtgactag gagggtgcag gtccatgttg 

70021 ctgggtctta tttttgtaag gaagtgagaa atctggattt gcatgtgaag tctctgatct 

70081 ctaaatactg gcaaaaagca atttaagaat aatgtaggtg ctgtgctgct tgtgaaatga 

70141 accaaagctg ggtccccagg tctgtctgcc acctcagctc cacgcccaga ccaaagctgt 

70201 gtcctcaggt ctgtctgcca cctcggctcc acgccctgag caaccgtttc cctgtatcct 

70261 cctccgtctg gcctggagag gttttgagca cagtaggttc tggccaacca gagtgagtcc 

70321 acctgcctct gcctcacagt ggctgaagct Ccccgggccg cagttcacgc cccacttgtt 

70381 tctccacatc cctttcccac cttgaaggac agctcctgat ggatgtgttt tatggtggaa 

70441 tcctcacgca ggggcctgag gaacactccc tggttatcca cacggcaact aacactcagc 

70501 acgcaccagg ctcctcctcc ctctcatgga ctctcaattc agccccRcac agccaagggc 

70561 tcacccacat catgtgcgct tcagacaaag gactaatttc cttaatacat aaagcgctct 

70621 tacaaatcaa caRgaaacaa tccagcaatc caaaagggaa atggacaaac ggtattgata 

70681 caaaagtgct gggtagagaa gggcatgctc cctttaaatg atatggaagc agggaagtgc 

70741 atggtccctg gctaagggct ccaccctcac ggacctatgt gaggacaggc actcttgcct 

70801 ttgtgcccaa atgtcacatt tcccaagacc accctggtcc accatgtccc catcctgtgc 

70861 ctataaaaac ccgagaccct aacaggcaga cacaggcggc tggacgtcga gaggagcgca 

70921 tgaggggagg aacacacagg cggctggacg tcgagaggaa cgcaccaacg agcaccagca 

70981 cgctgcaggc caccgaccgg cagaagcaga acgacacgga gtttggccgg ggcagtcaga 

71041 ggagagccca ggctgcttag cggcccgact ccagggaaag ccctttgcac cccatcctcc 

71101 ttctggcttc ccccatctgc tgagagctac ttccactcaa taaaacctcg cactcattct 

71161 ccaagcccac gtgtgatctg attcttccag gacatcaagg aagaacccga gatacagaaa 

71221 gccctctgtc cttgcgacaa ggtagagggt ctaactgagc tggttaacac aagccgccta 

71281 cagacagcta aagtaaaaga gcacatgcag cacacgccca ctggggcttc aggggctgca 

71341 aacattcacc cctagacact gctgtggggt cggagcccca cagcctgccc gtctgtattc 

71401 tcccccagag gtttgagcag agggacactg aagaaggaag ccacaccccc atagcacgcc 

71461 ctgcgaggag gacaagagaa cttctctcat ttcaatatga ggggacaatt gtcagaaaag 

71521 aaaagacagc agcttttctt ttcttcttta tctctttctt tctttccctt ccttctttcc 

71581 ttccttcctt cctttctttc tttcttctct ttcttctttc cttctttctt tttttttttt 

71641 gagacagagt ctctgtcacc caggctggag tacagtggcg tgatctcagc tcactgcaac 

71701 caccgcctcc cggattcaag cagttttcct gcctcagcct tccaagcagc tgggattaca 

71761 ggtgcctgct atcatacctg gttaattttt tttttttttt gtatKtttag tagagacagg 

71821 atttcaccat gttagccagg ctggtttcaa actcctgacc tcaggtgatc catccaccct 

71881 ggcctcccac agtgttggta ttacgggtat gagccaccga gcagcttttc agtgtatgag 

71941 aagacgctcg acctcaccct taattaaaga aacaacctgc atcagattgt ccaaataaaa 

72001 atgtacggat caggggtaca tgctgagaga aacggatcca gccaaccatc ctagggttgt 

72061 ggtcatcctt atcaaagttg aaaatgcaca aaccttttga atccacaatt ctagttatca 

72121 aatttaatcc tacaaatgtc ctcatacaag tagccaagga aataagtgta aggatatcca 

72181 Sggctgcatt ataagaaacR gtgaaaagaa gtaacgtccc catcagtcaa ggtctggtta 

72241 agcaagtgat tcacaggatg gaatgcctga gcaggaggaa tgggcaggtc tggaggggct 

72301 gctgtggacg acgtccaaga catgttccct gataatcgca aatgcagggt agcacacggt 

72361 gtagtctaca gtgtgcagga aaaagtcaga aaaacgaaaa cacagtatct acaggacaca 

72421 aaatcccttg agccagagat tctaatccta ggatttatat atgtgtgtgt acaNatgtgt 

72481 atatatatat acttgtgtgt gcacaggaca tcctgaaagg acacagaaga aaggtgtgca 

72541 gtggctgccc tgcagaaagg agctgaSttc caggacagcc aatgggaggg ggatgaggac 

72601 agcaatgagc atttcatatg gctgcaatcc ttagtccagc caaaaagcaa gccacgcaca 

72661 tgcaggagag gccaggactc tcccagcagc tcccccaggg gtcaggggca gagggagccc 

72721 ccagcctggc ctgctccctg acgactctac acttggcagc tgaagggggc caccctgaca 

72781 gtcaaggccc agctgtgatc tggggtggaa ctggtctggc accgatctgc ttggcctggc 

72841 tctgggtgct ggcacggcag tatctggtgg gctgcagggc ccccagctgg ggccctttga 

72901 gtgatgatga gcctgctggg ctgggaccac agagtgagca ggaccagcat cccctcaacc 

72 961 ccagatgccc tcYaggactg cagggcccag acacagagag ggaacctgac gctcatgggg 

73021 agggctgaga ggcaggcaga gggggaaggg tgttcctgac cacggcccac aggaccagca 

73081 gtgcagaggg cctgggagtg agggaagccc ctgctgtgcc aaggagggag ggccaggaga 

73141 ggagagatgg ccgaggccgg gaggagggca gaagtgcaga gagactagag ggggcagggc 

73201 agccctggct tggggctgga gagggagcca ggaaggaggg tggaaggcag gggaggcagc 

73261 gtgtggacaa agactgagag ggggagtcac agcttgggtg cagaaagtgg ggccagggag 

73321 gattccgggt ttctgacctg ggcaaccggt caccctggga agccttccct gcaagggacc 

73381 ctcgggagga gccgcgtggg aggaggagaa ggccgtgtgg ggcgtgttgg gctgagattt 

73441 ctgtggggac acaaggacag acacccaYga gacactcagg caggagtggt ctggagcaca 

73501 gagagagggt gggccggagg ccccaggtgt gggcggcctc caggatgaat cagctgcaga 

73561 gcaccgcagt ggcgcacaca gatcagtgtc ctgatctgaa aagtaggcat aacgatgctg 

73621 gccacagccc acaagtaaga cacatagaaa acctgaggcc cacagaggtg acaggaacgg 

73681 gccaggtgca cggacagggc tgttcctgac tcagctcact ccaccagaac ccagtctggg 

73741 caccagcctg ttttcttgtc ctctccagcc agcagggcag aattgggggt ggacttgccc 



119 



PATENT 

Docket 524593008900 



73801 cagcctgaag acctccgggg ccctggctcc tccctgctcc acctgccata gcccaaaacc 

73861 agtgtgggtg tgaagcggaa ggagactcYg cagcaaggcc cagctcttat gccggccact 

73921 gccttgcctc ctgcctgatg cagtctctcg gcagcaaggc aggtgccctg agacctggag 

73981 gtgcccccag ccccagaacc caccatgcgc ggctgggccg actgtgcctg cccttaaggc 

74041 aggcagaaca ctaaccaaag ccagcggaca tccactgcct gcggagcatc tggggaaggc 

74101 tcctgtgggg ctccagaccc ggggcggtgc cttcaccacg gcctcctcat ccagcatcca 

74161 caactctgga gctaggaagc ccggtgctcc ttcccagagg agggagccga agctctgcca 

74221 gtctgcactg aagtctgtgg tgtgcggtca cctgagaaca gctcagtcag tctggctgga 

74281 tcctgagctc ctcccccggc ctgtgcttga gaggtgacag ccagcacgtg agcaagtgcc 

74341 ctgggttctg gtcccagcaa agctacctcc agctgtgcac ctcggtcagg cacctccagc 

744 01 tgtgcacctc ggtcaggcca ccctctctcc gtcagaccct cagtgtcctg atcagtgtcc 

744 61 acaagtaaga cacatagaaa acctgaggcc cagagaagcc aaggaagtcc ttacaggctt 

74521 ctgtcagaga atcggaggaa aggaggggac gccagagaat ctgggcttaa gtgacttggc 

74581 cgtgatcaaa acacaggcag agtcacaaac acgagacccc gctgcccact cccgacggag 

74641 aggtaattcc acaaacacga gaccccgctg cccactccac aaacatgaga ccccgctgcc 

74701 cactcccgac ggagaggcaa ttccacaaac acgagacccc gctgcccact ccacaaacac 

74761 gagaccccgc tgcccactcc cgacggagag gtaattccac aaacacgaga ccctgctgcc 

74821 cactccacaa acatgagacc ccgctgccca ttcccgacgg agaggtaatt ccaggtcagt 

74881 ggcatccagc cttccccggc ccccaggccc ccagctgcca ccaagaaaca ctctttgtag 

74941 gggtcctgcc ccactggggt gcagaggtac atcccccaag gcccagcccc gcaggaagcc 

75001 cttcctctgc agccattctt gaaatcacct ccaagaaaaa agaaatgctt cagcttccaa 

75061 agatatcccc aggcaggaaa tgggccgcag gcagcatgaa tgtttaatta cgttaattgt 

75121 ccaagtccag cgcacatcat ccctcctaag taccttatgg gaaatcgtaa ttaggcgtga 

75181 aatggctcat tttcacaact cccctttctc ggggttctcc agggcgcatg cccctgtcag 

75241 ctggccggac aggcctctct tgccaaggct taccctcctt ggttctaatt taattttgtg 

75301 accttttctc tcgacatcct gaccccgtac atccctgcgc cttggcacac tgcaccaaag 

75361 caccatagaa ttactgagct ataagcctct catggaaatc tgtttttgac tttgttagaa 

75421 aacactataa tcaaatccct gagagtcagg agggatgggc ttggcctgcc agaggaaaca 

754 81 tatcaccgga ttaagtgatt tagcagaggt ggtcccggcg ggaggagggc agggcaggct 

75541 gggcagggct gggcgtgagg tgggcaggca gggtgccctc gggggcaagg ggggagggga 

75601 tgcgcagagg cagaggatgg gcaatgtggg attaccagaa agtgcggtct ggacatccca 

75661 gggcctgcag gcttcctgag tgcctacctg gccctgggca ctcacagggc attgatgccg 

75721 agatgctcaR ctgtgcagag ggtgtgcagg tctggtggtc acttatgcgc tccagggcct 

75781 gcttctgctg atgccagggc aagacccagc ctcaaacaca catcaaatac atacgcttga 

75841 cgttatgccc gggagggctg tgcacacagg cacacagaga cggtgagact tggggatact 

75901 tgggtccagt tactgtcacc ttgggtctca cttctgggag cgtggatttg gctggcacca 

75961 ctgcactgcR gtgtgtctgc gtctatacag acttcctcca tttcacgcat gtacctaaaa 

76021 cataccacca ttcatgtact gtcaggcact ctttcatttt gtaaaacaat agggttatca 

76081 aggcctgcgc agtggctcat gcttctaatc ccaRcacttt gggaggccga ggcaggcgga 

76141 ttgcctgagg tcaggagttc gagaccaggc tggccaacat ggtgaaaccc catttctact 

76201 aaaaatacaa aaattagccg ggcatggtgg tgggcgcctg taattgcagc tactcaggag 

76261 gctgaggcag gagaatggct tgaacccggg aggcggaggt tgcagtgagc cgagatcgtg 

76321 ccactgcact ccagcctggg cRacagagcg aaactcagtc tcaaaaaaaa aaagccgggg 

76381 gggctaccaa aagatgagag ggacccaggc ctgtattgca catccacatc agtgacagtg 

76441 taatggtaYg cagaccccct ccccRcatgc acacgcattc tgaaataagc caggcccaga 

76501 aggacaaata cggcactcgt ccactcacat gaggatctgt aatcaaacgc atgaaagcag 

76561 atgacaccat agaggctgcc agagctgggg gagcgggaaa tggggagttg ctgttcaatg 

76621 ggtgtaaagt tttagtgatg ctggacgagt aagtactaga ggtctgctgt acaacgttgt 

76681 gcctgtagct gaccatgtgg cactgtgcac ttcgaaattc tctgagaggt ctgagctcac 

76741 gttaagtgtt ctttccacaa aagcaaacaa caacagcaac agaaatgaag Kgacagaagg 

76801 aaaccgtggg agaagacgag tgtctaccgc ctcgatgtgc tgatggcgtc ccggtgtctg 

76861 cacaaatcca cacttgctga actgtaaggg gccggtgacc tctacagtcg cctaccatgg 

76921 cgagcataaa cttgcagaga agaggatgag aattctgata ataggacaaa tcaaagctta 

76981 acaagtaatt aaaggacaaa ttggtgcata taacaagtcg ttccctgtct cgcagctgac 

77041 atggatcaat gtccaaaatt tttatccaaa ttgttataaa tacacatgag agtcttctcg 

77101 gtgcagtgaa tagtgaagtc agcagaggca gtggctcacc cctgcagtct cagcacttta 

77161 ggaggccgag ggggtaggat cacttgagta ccagagttag aggctgcagt gagccatgat 

77221 tgtgccactg cactccagct ggggccacag agcgagaccc tgtctctagg aaaaaaaaaa 

77281 agaaaaaaga aaaagaaaaa aaaaaagagg aaattagcca atatcccaga gagtgaaggc 

77341 agaacccgta tcagcttttg ccaaaaatga gaactccaca gttgcctccg cattctgtct 

77401 tttaaaaagt gtcaatttta cattatgaaa tagagtgcat ctttttaaag taaaatcgat 

77461 gctccaggaa gatattccac gtttatcctg gactttggtg ccctctcgac acttggtttt 

77521 gattaaatta atccaaactc cagtttgcta agccagtgca cctgtgtgcg gctgttacat 

77581 tgcaccaatg ttttatcaaa tctccgacta tgcccaagtg cggcgccatg gaagccacac 
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77641 


gggatccctg 


acagacagaa 


acatccatac 


tgctcttaag 


gtggctcagg 


cactgttcca 


77701 


agcacctcgt 


ctatagttgc 


ccactgattc 


ttcgcagcag 


ctgtgtgagt 


tagctgtgat 


77761 


gcttatcttc 


attctgcaga 


tggggaaact 


gagaaaggct 


atcccaattt 


cacactgctc 


77821 


ggaagtgtca 


gaaccaggat 


atgaaccctg 


gcatggctac 


caagagtctg 


tgctcttgaa 


77881 


cgctgtgcta 


ggccactcct 


tctggggctg 


ttacatttta 


actctgacat 


gtcaagcaaa 


77941 


tcactataag 


tttattgagt 


catttcttag 


ctgcttcatc 


tgtaaagtcg 


ggtgacaatg 


78001 


gctgcaggtc 


ctgccactca 


gacgattccc 


acgcgtgtct 


gYatcaggat 


gaggatgagt 


78061 


agatcggtaa 


agtgccttgc 


agactgtgat 


gatcatattt 


ttgtctt tga 


ttccactgcc 


78121 


agcacaatac 


cttttttttt 


ttaataaagg 


gctgttgaac 


tgcaattccc 


atgtttgtat 


78181 


acattcttct 


tctgtttctt 


tggttaaagt 


ttcaagattc 


tcttagggtt 


caaatgttta 


78241 


aatttcaaat 


cctttggctc 


atctgactaa 


ttacaagcat 


aaaatatggc 


atttccaaca 


78301 


agtctggaga 


acggaagcac 


atgaaattat 


gcacgaccat 


gagcatctgt 


gtcgggaggc 


78361 


gggaaggcgt 


tcacccttca 


aactcaagtg 


aaaaaggttt 


gccgtagtta 


aaggctgctg 


78421 


acaaataact 


actgttctga 


gaaatcacca 


ctgtttactg 


ggagaatcgt 


ttttcagcaa 


78481 


atatatcaag 


aaaagtaaat 


atcaagtata 


atttgcagag 


ttgcagataa 


ctccacattt 


78541 


tggaataaaa 


aacttctgtg 


tctgtgtaca 


agtgtgcaaa 


acacagcaga 


gacatttttc 


78601 


tttccaatcc 


tgcctaaata 


ctgactgaac 


atcatcagtg 


ccatctccca 


tgactgtgac 


78661 


agtgtatctt 


gtgtgtatgt 


gttgtatcat 


attatctgct 


ttgttaaatg 


ctatgcttgt 


78721 


taaaatgtgt 


cacattaaaa 


aacaaaaaca 


gcccgggcat 


ggtggctcac 


gcctgtaatc 


78781 


ccagcacttt 


gaaaggctga 


gggtggggga 


gggtggatca 


cgaggtcagg 


agttcaagac 


78841 


cagcctggcc 


aacatggcaa 


aaccctgtct 


cctctaaaaa 


atacaaaaat 


tagccaggcg 


78901 


tgttggcagg 


cacctgtaat 


cccagctact 


cgggaggctg 


gaggcaggag 


aatcgcttga 


78961 


acccggaggg 


agaggttgcg 


gtgagaggag 


attgtaccat 


tgcactccag 


cctgagtgat 


79021 


agagtgagac 


tccgtttcaa 


aacaaaacaa 


aatgaacaaa 


caaacaaaac 


aaacaaaaac 


79081 


aaaaaaacag 


aaacacacac 


tacactgact 


acactgaagc 


cctcaagccg 


ctgagcagcc 


79141 


atctccaggt 


ccccaaggac 


caggcaccaa 


ggtctggggg 


tggcccagca 


tccattctcc 


79201 


cctttcccag 


agcagcagcc 


cagtgttatt 


tcctggaatt 


cctggaaccc 


tgcatgcaca 


79261 


cagccctccc 


caagccaaac 


atcctctccc 


tcacagagtg 


tgggatttga 


gccgcaggag 


79321 


ccagaagagg 


aaaaggtagt 


ggtgtcacta 


aacacctggt 


cacaatggtt 


aagaccctcc 


79381 


tgtgtccagg 


accccgactg 


ccaggccctt 


ccacagcccg 


acLccccgtg 


gactctgcag 


79441 


gtttcctaat 


gtcctcccag 


taactcttcc 


tggctcaggg 


cagctggact 


tcatgcaagc 


79501 


tctctgccag 


aagagaccct 


ctcctgccac 


aggcgccctc 


cccatcagat 


gccagagata 


79561 


tcaccagccc 


tcgaggctta 


acccctgctc 


tgtgggggat 


tcctgctgct 


ctgcctgaat 


79621 


tcacccaggt 


gggcagtcct 


caggtgactt 


ctccagctct 


gcatggagaa 


ggcctcacca 


79681 


tgcatctcag 


agagaagtgg 


aggctgcacc 


cccacaggaa 


gtggggccca 


gtccaccgtg 


79741 


ccctggacgc 


tgctcatcca 


ctaggactgg 


gcttccgtgt 


gcagggcttc 


accctcgctc 


79801 


tcctctctgc 


ctgctccaca 


cctgctgctt 


tgacctccat 


gaccatggac 


ctgctgacag 


79861 


gtgccccccg 


acccacctcc 


agcctgcacc 


tccccaggct 


ccagaccccg 


ccccctgctg 


79921 


cctgcaactg 


cccctggggt 


gtcacaaggc 


accttcatgc 


caacaagtcc 


caaacaaacc 


79981 


tgtcaccctt 


ccctacccca 


acccatcagc 


cattgctgcc 


gtgcccatct 


catctccaca 


80041 


tccaaggagt 


caccgcatcc 


tggcaccttc 


cccatctccc 


cgtccatgat 


gagctcagcc 


80101 


cctccctcta 


cctggatgca 


gcagcctcct 


cccagacctg 


gccccatccg 


ctgctgctcc 


80161 


ctgcccttca 


ctccctgtag 


ctcgagtcct 


cacctgagct 


tgcccccatt 


aaaatcctct 


80221 


gtacctccct 


ctccccaagc 


ctagcagacc 


cgtgcgggag 


tgcgagcaca 


ctggctaggg 


80281 


cctcctgatg 


ggcctgacct 


gcctgccaac 


cctcccaggg 


acctgctcat 


gttcccgcaa 


80341 


cgcccttaca 


gccatgggtc 


cgagaaaaca 


ctgctcactc 


tgcccagaat 


gcctgccctg 


80401 


ccctttctgg 


cagactccta 


ttcagtcttc 


aaaacccagc 


tcccagctcg 


ttaggcccgc 


80461 


aaagagagca 


agttactctc 


tctcctagac 


tcaagcaccc 


tgtgcttcct 


tcttttaaac 


80521 


cactgatcat 


gcagaagtat 


aagctcacca 


gtttccaagt 


cccctaacca 


gaccatgagc 


80581 


tccaggggta 


aaatcatgca 


tcagattcat 


ctctgggtca 


cctgccttgg 


ctgagaggag 


80641 


gctcatgaag 


gtttggcaca 


atgggatgaa 


ggaatagaca 


aggagcaaga 


atacgagctc 


80701 


atcctcatgc 


ctcagaatag 


catgaaatgc 


tcagggtgtg 


tgggacagaa 


ctgacgcYtt 


80761 


cacaccaatg 


gagccagRtg 


ccagcctatg 


tgtgtccaag 


tcctaccaag 


tcccaggctg 




catggaaatc 


agagggaccc 


caaatcgaat 


gagcgagggg 


cccttaaaga 


aactcacctg 


80881 






a 999gttggt 








80941 


tgttcccacc 


tagtcccaag 


catctgagca 


gggccaacga 


aagtgctggt 


aggccaggcc 


81001 


ttgccctgga 


gaggaagggc 


agagaggaag 


gagaggggaa 


gcggggtgga 


ccacctctgc 


81061 


tcagaccaaa 


ggcagaatcc 


agtgctgatg 


gttgccaata 


gggtcctctc 


tgttgctggg 


81121 


cccccatgaa 


ccatgtgagc 


cctgcacaga 


gctcctgcta 


gatcctgggg 


gtctccctgc 


81181 


actgggattg 


ctccaacagt 


gcccccaccc 


tcagggcatg 


ggttgccctc 


tgcatctggc 


81241 


caagtgcctt 


cccagacacc 


ctgcagcctc 


agcctcctgt 


gaccactcat 


gtccacctag 


81301 


cctggtcctg 


gtcccaggtg 


ccctcctggt 


ctagctgcca 


ctaaacagat 


gagaaKctcc 


81361 


ctcccatccc 


agccctgaaa 


tatgctcagc 


ctcaaagcct 


cttggcacat 


ctcagcttcc 


81421 


agatgtcctg 


tgccttgtga 


gatattaacc 


acactctgcc 


tagccctctg 


ccatgaggtc 
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814 81 agctccgctc ccaggcaggg ggaagagctg ccttgggagc aatgatgggc atcgtccctc 

81541 tgtgctgtgt ttggggtctt ccctccctcc aagtcYtgga ctttgggccc agaaatcact 

816 01 ttccctccct cctctctgag cacacaaagg gcaggacttg tcttcaggtg cttggggaag 

81661 agggacgcca attgttccag ccactcccRt aagggtgaac caggagacta accatcaaat 

81721 tcaccaggaa atcccaacct ccaggccctg actgaggcMc taagacccaa gcaagcaata 

81781 tcaatgaaag ccagaggtgc ttaaggtggg gaacaagaat gagctgtcag aggcctctga 

81841 taaagaatcg tttaaaaaga ctttctttgc tgcggctcat gcctgtaatc ccagcccttt 

81901 ggaagtctga ggcggtagga tcacttgagg tcaagaattc aagagcagcS tgggcaatat 

81961 ggtaaaaccc tgtctccata cacataaaaa aaattagcca aacgtaatgg ggtgcacctg 

82021 cagtcccagc tactcgggag gctgaggtgg gaggactgac tgaccctgag aggttgaggc 

82081 tgcagtaagc cgagtccatg ctgctgcact tcagcctggg tgacaaagca agacctggtt 

82141 tcagaaaaaa gaatttcccc cttcgatgta tgctatggac tgaatatttg tacccccaaa 

82201 attcatatgt tgaaacctaa tccccagagg gatgatttta ggaggtgggg cctttggggg 

82261 gtaattaggt catgagggtg gggcctcatg aatgggatta gtgcccttat caaagaagcc 

82321 ccatggccag gtgcagtggc ttacaccagt aatctcagca ctttgggggg ttgagacggg 

82381 aggatagctt gagcacagaa gtacaagacc agcttggtta acacagcaag ccctcatcac 

82441 tacaaaaact aaaaataaaa aaaaatatgt tgcatgtggt ggctcacatc tctagcccca 

82501 gctacgtgag aggctgaggc aggaggatcg cttgatccca ggagttcaag gacacagtga 

82561 gMtaggatcg tgccactgca ctccagccta ggtgatgagt gagactctgt ctctaaacat 

82621 ttttttaaat taaaaatgta gaaataacat aaaagaggcc ccagagagct ctcttgcccc 

82681 ttgcaatgca tgaagacaaa gtgcaagaag gtgccatgtg tgaaccagaa ggtaggccct 

82741 cagcagacac aaaatctgcc agcactttga tcttggactt cccagcctcc agaaccatga 

82801 gaaagaaata aacactatgt aagccacctg gtgttggggg tgctttttgt ggtagcagcc 

82861 tctactgact aagacagtgt cacctcagct actctcaaga aaggcttcgt cacctggatc 

82921 cagtcgcgtc agttaagcgg tgctgcccag ctctgggagg ctgggggaga tgaggcgatg 

82981 tgtgattctc aaggtcagca tgttggaaag tctagcaggg tttaatgagg acggaattaa 

83041 aggagacggg gcctgatgag gggctctggg cctctctgtg tccctgtctg ctttgtctgg 

83101 aaaatggtga aaggaaggag gcatcttgga gcatctgtgt cctctggttt tccttgatcc 

83161 ctggattcac caggacacaa aaacccagtg cttcttgtac tgcaaggtgc ttgagaaagg 

83221 aagagcaaac ccacaagctg aagagtccca ggcagccatc gctcccgaca cgcacgccgc 

83281 agtgtagctg tggctcctga gaacggcggt gttaaaggac agttacttaa ccagaacttg 

83341 gggtccccct gccccgcccc acggaggcct tgaacatcct acaataaaca cacaccgaag 

83401 ccatgttctt ggagagactt tggggaggct gtgcactctg gaaacagcca agctgttctg 

83461 tggcttggaa gaaacaagca gttccatcca ctctgggctc acagatattc ctcctgagct 

83521 tgaaggaagc cacagcggaa cttggagctg ctctagttct ctggaacatg cagcaggtgg 

83581 aaccgcagac Rtctctccag ggagctgagt cccggttttc acctgctggg tgtccacgga 

83641 ttcaccccca gctactgtgt aggggccagc tcccctcagg gactgctcct gggagaccgY 

83701 gggagtcacc gcagctccct gctctgtgga gcggccacgc tagcagaagg aacagagggt 

83761 gccagtgctc tgagtatgcc ggagggcagt gagtgctgcg ggaaggggct ggagggcaca 

83821 Sggtctgcga gtgccagggc aSgtggtgcg gggggagctg tctacaaagc cttagagcgc 

83881 cagggtaggt cccactggaa gggaacatgc aagcaaagac tcKaatgaKg tgggagggtg 

83941 agctgagggt gtctagaaag ggcatccagg cggaaggagg accccatgtg gggttttgaa 

84001 aagaagggca aggaggccca Sgatggcagg aatcaagtga gtgagaagga aaatggaggc 

84061 cacggggtcc cagggcgagg cgctgggtcg tgtggggtct ggtaggtggc ccgggtgctg 

84121 ggcgctgact tctgagctga acgaagcagg agccactgtg ggatttccag cagaRgacag 

84181 acatgatcca gcagaaggca tcaaaggatc cctgaccacg gaccgcccca ggacccttcc 

84241 accatagtgc aggattcctc ttacaggaaa tgccccagta ggcagatcta gagaggcagg 

84 301 aagtggatga atggttgcct ggaaggtggt gggggaggaa ggacgacagc taaggggtgt 

84361 ggggtctttg gggagggaga agatgttgtg aatataataa aaaccattga atcgtaYact 

84421 ttaaatgggt gacctgtata gtatgtgaat tatatctcaa taaagctgtt agagaaaaga 

84481 atccctgctg ctgtgttgag gacacatggg ggtggggagg cagggaggtc tgctgggggc 

84541 tcccaaagtc atcagaagag gacaaaggct tgacgcaagt gacaaccgtg gagatggtga 

84601 gaagtgatca gggtcggggt ggattttgaa agaaaaccta aaaagatttc tcaacaggtt 

84661 tctcatcagt gtgagaggaa gagaggggtc aaccctaact gcagagcttt ggctgtgcaa 

84721 atggaaggaa ggatcagccg tgcacaSagc tggggaaggc cgagggtgga gtgcggagag 

84781 ggtgagcgcg agttcagtga ggggcgtgct gaggtggtac ttccaaattg aggttggcaa 

84841 gtagagaaat gccttggaac ttggatttga agagtgcaga gacagatgag gtgcagattc 

84901 tgctggagtc aggagcgtgg gcaatacaga gagcatggaa ggtggagggg cagttggaat 

84961' atgagtcttg ggtgcaggag agagatctgg gctggagacg caaatctggg agccatttca 

85021 tggtacataa agcaacaaga ggaatgagat cgcccacaga ggggtgcaga gggagaccaa 

85081 ggagcgtcca ggacagagcc ctgggcgcat caaccccaga gggtgcggga agaggaagga 

85141 tcagcaaagg aaacggggat gaactgtgag tgctgtgatt agaccaagaa agtgggtgtc 

85201 cggtaagcca caccaaggga gcaggaggga ctagcctgcc acgtgctagg gatggcgact 

85261 gaggattggc agggagatgc agcagcatga tgacctcggg gagggtgcct ctggcagagt 
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85381 
85441 
85501 
85561 
85621 
85681 
85741 
85801 
85861 
85921 
85981 
86041 
86101 
86161 
86221 
86281 
86341 
86401 
86461 
86521 
86581 
86641 
86701 
86761 



86941 
87001 
87061 
87121 
87181 
87241 
87301 
87361 
87421 
87481 
87541 
87601 
87661 
87721 
87781 
87841 
87901 



88141 
88201 
88261 
88321 
88381 
88441 
88501 
88561 



621 



88861 
88921 
88981 
89041 
89101 



ggcacacaca 
gatggcgtct 
agacgaggga 
ggacggaaga 
ctgctagcgg 
tttgttcagg 
agggcattgt 
taagaSaagg 
aaggcggcag 
cgggaaaata 
gctggcaaaa 
aaaccttgac 
gtcaaatgag 
agaaactctg 
acactgctgc 
ggttttgtcc 
tgccgcagaa 
aaagaggagg 
tttccctaat 
tgctggagaa 
gaaccttcga 
gtcactgtgt 

agcctctgtg 
gcagatcaac 
ctgctctttc 
tttgacgttc 
attaaaatca 
cctctttctt 
tgcaggaaaa 
atgtcgaaga 
tctctacgca 
tagccccttt 
taattcatgc 
gtactcaatg 
ctaatcatgg 
tcagtaagct 
gtctctgcag 
gtgtgggcca 
cttaattgga 
taccttcctt 
ccacttctgg 
actggtgaaa 
ctaagagcac 
gaagaaaaat 
tcttccccgc 
ggccaagaac 
caggatagat 
tccctgctga 
caggagctcc 
cagcctgctt 
ctacctcctg 
aagaggaggc 
ggaaaaaaaa 
tgaatttcaa 
tgacaattta 
aagcctagta 
aagggacagt 
gctcaaggct 
aaaacagtct 
agatggggga 
acaacagaaa 
agaaagccca 
ccattggaaa 



gaggcctggt 
ccacctaggg 
agccgtcagg 
tatgaaaagt 
caccttgctg 
ggtacccgtg 
tcagtagatg 
aggctgggta 
catcaatggc 
gaaggtcatc 
gaaggtttaa 
agctgtggtc 
ttcccgtgac 
cagaccccat 
ggaaaccata 
ggtaaaggga 
gacaacgtta 
aaactcaaat 
agatgggttt 
caatgattca 
gaaccggagg 
tgccagggac 
tcgcaggcaa 
aggcacacac 
ccagccgggc 
atgagtcagg 
aaaacgacat 
aaagagcaca 
ttacttgtgc 
gtgtagtcaa 
gatttacagt 
ttttttaaaa 
ccgccagcgg 
ccatSctact 
ccctcagcac 
gtcttccatg 
cacagcgtag 
aagagagcca 
gcggtgcctg 
tttggatctc 
attgctgtgg 
aatggtggtg 
agtattgaaa 

cccagtaaga 
ttctagtcct 
acagggctcc 
gggccaccag 
gtgtgaccga 
gtctcaggtg 
gctagaaaaa 
cccaatacct 
tgacagctct 
aaaaaaatca 
agactggcct 
tgtctcagga 
gctgggtgtg 
catagggagc 
gtgccctcca 



ccaagtgggg 
ggtgctgctg 
tgcgaatgcg 
catctggaag 
gtgaacatca 
ccagcatgaa 
ttacaaagca 
aggggtggag 
ctgcaggttt 
agaatgcaac 
ggtatgacca 
atggtcgacc 
gcgtgtcagt 
gccttgccRc 
aacgtcacca 
agaggaacag 
atgagcttgg 
ctcaggagag 
agctgcatgg 
agactacaaa 
ccatgccact 
agagtgcaag 
tgggaccctg 
tgccagagat 
ttccacgtcc 
gctttgggcc 
gaaaattgcc 
gccttcttga 



ggggcaagtg 
gaagggcctt 
ccccagtggc 
agtgattggt 
agacctgaac 
gcctggggct 
agtcctagga 
ctgtcttgct 



ggggaatcag 
cacaaatgac 
gatgtcagat 
tatgtctttg 



agaggagccc 
agtcaccatc 
aagcaacacc 
acagcaagaa 
tccagttcag 
ctcttccctc 
cgtttctcat 
Raggccggag 
cagcagggcg 
tattccatgc 
ggagtggtgg 
atgagaacaa 
ggagaacaga 

cattgttgaa 
ataccaaatg 
cctgctgaaa 
acagctgagg 
ggaaacaaat 
tatgcagagc 
tcattcacag 
gtaacagctg 
aaaaaaggaa 



ttaagacaca 
agagagcaaa 
gatagggagc 
gaggcaaaca 
tcgtaagcaa 
gtaggtagaa 
agagtgactc 
ggaaggatca 
gctggagcac 
actgcaacgg 
aggccaaatc 
tcatgggaaa 
gcgtgtggat 
cctcgaccct 
gattcagagc 
ggttcaactc 
cttaacttac 
tttaatggca 
gtgaggtaga 
gaaactgaag 
gccttgtttc 
gtgcctggaa 
caccagtggc 
gacctcacgc 
agcaaaacga 

atgtcaaagg 

gcacatgaag 
aaactatagg 
ggggaccatc 
gccccaccca 
tgtggtgggg 
ccagggggtg 
attgggagaa 
gcgtatgtga 
ggaggagaga 
cgcgtgtgcc 
ctgaatgacc 
aaagattagg 
catggacaca 



tcggtggcac 
ctcacgggga 
agatcccagt 



agagtgctgg 
tagggggagg 
cttcagtgaa 
caacctgaac 
tcgttatgag 
cctacccaaa 
aataatagag 
aagcagagag 
tcactgtcat 
cttcccactg 



tgccacatga 
aaaaaaaggc 
tcagattttg 
accatgctta 



gtgagaggag 
aagctgggag 
agatgttgga 
gactgagggg 
gagcagaagc 
agtttcattt 
caatgaggga 
gtggtgtgag 
aggtcggagc 
attatgcagg 
cagtggggag 
ttttgcaaac 
cttgctgttt 
gatacgcgat 
ttgagcccac 
tcctggcaga 
aagggatttc 
ggagtgagct 
ccaaggccct 
tgtgagcgga 
gggtggagga 
gctcctcggg 
cagtacctgc 
tggtggaaaa 
ggcgagttcg 
ccaaacagag 
ccattctgta 
tttgggcaga 
tttcaggaag 
tttcttttta 
caaaatagtg 
ttgttgattt 
atgtgtctct 
gggcacatag 
aacacatgac 
aaaaatattc 
agagccaatc 
cgctgagcct 
agcaaatYtc 
ctgacatatc 
aatMatctgS 
ttcccctcag 
taaagtctct 
gaaaatcaac 
ctgagccatc 
gttccactct 
caaggaatac 
tcccagtgac 
cacccagccc 
gctcctaatc 
caaaccaaga 
gactctggcg 
cacagaccag 
gatctcttcc 
tttaattgga 
caatcatccc 
tgtaacagac 
cccgggatga 
tgaagaatac 
acaactgcaa 
cccaaaatgt 
ggcaacagac 
cagcaaagat 
tgtaaaggag 



aggaatcgaa 
agtaggtgac 
ggtgttcaga 
agcagtctgg 
agggctgagc 
taccaggact 
acctgcagtt 
agaccgtgac 
aaaagggaac 
tataggggcc 
tcctttttgg 
ttgatcctgt 
gggggagggc 
gccaatagac 
ctctcccaga 
actctatgga 
cggtYttgtg 
tgagtgttgg 
ctgtagaata 
agataggagt 
acaaagcctt 
aacgttagct 
tgccccagcc 
cgaggagaca 
ggttgttatt 
tcaaaatcaa 
atttcagtgt 
attcaagttt 
tgaccatttt 
agttctgagt 
tgtgtaccta 
ttcacaccag 
ggtggatggg 
cccacacctg 
tggagtgaga 
ccttcctggc 
tgagaaggaa 
tggcagcctc 
cttttcttgg 
cagtgtcttc 
catgagaatt 
aacaaccata 
agaaattatc 
caaatcttgg 
acttgctctc 
gggggtgtgt 

agagggtaac 
taactcaggg 
accctaagtc 
agatcagagg 
gcaagaattt 
cttatttact 
agggggagaa 
tcaaactatg 
tcaattagtg 
agatcagaga 
ccctgcacac 
agactttgtg 
acagcctcag 
ccagtttcac 
actgctggta 
ttcaaagcag 
tgatgatgac 
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89161 


aatatctcat 


caaataaaqa 


ctatcaataa 


aqaqaaatta 


taaacaaqaa 


ccaaatqqaa 


89221 


attct gg a g t 


tgaaaagcac 


cataactqaq 


aggcttaaca 


qtaqattqqa 


actqqcaqaa 

g 


89281 


gaaagaatct 


tgaaaacaga 


tcaataqaca 


ttatgcaatt 


tqaa a 
accataaa 30 




89341 






aacttaaqaa 






a cat 3 aacat 

g 


89401 


atgcacaatg 


ggagtcccag 


aaaqaqqcaa 


gtgagtataa 


aqqaq 




89461 


aaacagtggc 


aaaaagctcc 


ctaaatttqq 


tgaaaacat t 


aacctacqtq 


tccqaqaaac 


89521 


tcagcaaact 




taaatqccaa 




cccaqaqaca 


tcataqtaaa 


89581 


aatgccgaaa 


qccaaaqaca 


aaqaqaaaat 


tctqaaaqca 


qcaaaaqaaa 


g g 


89641 




aaaatccc 




a cc a tt 
ctatcaac/ 


t t t 




89701 


caqcaqaatq 


qcatatccaa 


t t 




a^atctttc 


tcca^a^a 


89761 


ctatttttca 


aaattaaqqt 


qcaatataqa 


caqtcccaqa 


taaqtaaaaa 


c^a^aatta 


89821 


tqaqtaqacc 


cattttatca 


Lcatactaa 


aqqaaqttct 


tcaaqct aa 


t 


89881 


ctcaattaat 


aacttqaatt 


cacacccaca 


cacatac 


acca taaa 


^aatca^c 


89941 


aactttaaaa 


qacaatataa 


atqqqtattt 


cttctctttt 


cttcccttaa 


ct cttaaa 


90001 


aaqcaataat 


ataqaaqaaq 


atqtatataa 


ttqtattatt 


cccatat 


aatatataaa 


90061 


tqtaatatat 


ttqactaaca 


qcacaaaqca 


qqtqaaca a 
acaga 


aacaca eta 


ct a taaa 

gg g 


90121 


aaaatqacac 


caqatqqqaa 


cttqaatcca 








90181 


caacaqqaaq 


qcaaactqtt 


qatatatatq 


tactctcctc 


tcttttttca 


qcttctttaa 


90241 


q taaaatt 


atataaaqca 


ataattacaa 


caattqttaq 




t t t 


90301 


tqtqqtaqqc 


aqaactqtqq 


ccctccaaaq 


atqttcacat 


tctaatcccc 


t t 


90361 


ta aq 


gttatatqta 


aaqqaaaatt 


aacattqtaa 


tc 


tttaaaatat 


90421 


tqaaattatc 


Ctqqattaqc 


caqatqqqcc 


caqtqtaatc 


acaaqqqtcc 


ataKa 

tt a 99999 


90481 




caqaaqaqaq 


aaactqqaqq 

gg gg 


qatqqctqca 

g gg g 


a a 
ggg aggac 


ggee ga 


90541 




taaaa 

g g 


aac^atcct 


ca gagcca 


aggaa gcag 


g ggee cca 


90601 


aaaactoaaa 












90661 


cctqccaaaa 


ccc attta 


aactcaqtqa 


qactcatttc 


aaactctaac 


ttccaaaact 


90721 


qcaaaataat 


accattqcat 


tatttaaqcc 


actaaattt 


t taattt 


ttacc ca c 


90781 


aatcaaaaac 


caacacaaqt 


tqcacaqcaa 


qaacaqtaca 


aaqat 


tgggceggge 


90841 


ycggcygctc 


acqcctqtaa 


tcccaqcact 


t cgggaggcc 


c^ctctact 




90901 




qaqaccatcc 

9 9 


tq^ctaacac 




c 9 actc a 


aaaaata 9 " 


90961 


aaaattaqcc 


ggycgtggta 


ycgggcgcct 


ataatcccaa 






91021 


qqaqaatqqc 


gtgaacccgg 


gaggcggagc 


tCqcaqtqaq 


c tc 


qccact cac 


91081 








ctcaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


91141 


aaaaaaaMaM 


agatgggggtv 


qqaaqaataq 


aqctatataa 


at 
at^catac a 




91201 


acaqqaatta 


agttaatata 


aatctaaa a 
aa c aaaga 


t t ct ataa 


a gca acga 


agggagag 


-91261 


qac t 


aag gacaga 


a agaggac 


ccaaaattct 
ttt^aa t 


gcccc ccac 


aaaaacagga 


91321 


t 




g 




^tttt t 3 


ccagaaacct 


91381 


acaacaacca 


tt 




t 

^^ at ^ C t 9 


aa gg g 


acagtgggt t 


91441 


atggtgtttt 


aatttactt 


cttaccatta 




caac cagca 


gtggccatga 


91501 


caa^actcc 


ccacattcct 


ggtgtaggtt 




acc acaca 


ccaaggagct 


91561 






actcaaaaga 


aagaaa a g 


ga c a a 


ccagagaat 


91621 


gtggttgtgg 


ttttaaccc 


g gg 


ttt^tcaaaa 


tattta 9 C 




91681 




aa t tttc 
cacct 9 c 


ccagacag g 




9 3 R 3339 


^ 99a9 ° *~ 


91741 


taqctqcaqa 


gggg 


acaggacaa 


aKtt 

3 a atattt 


agac cacaa 


aaa ? a „ a c 


91801 


taaaqqctt 


gggaggaaga 


gggcagggaa 




gaggaaa aa 


ggtttttgga 


91861 


aaqctcttqt 


aaa ^ctaa 


9 aa actcaa 


ttctaccact 




gggtgagatg 


91921 


catqctcaqa 






aa ttttcac 






91981 


ctqtacaaqt 


aqqaaqt 
aacac ^ 


t 


a tt taaa 

g 


ca°tct C ct 
a^atttctt 


aa cat" 9 


92041 


ggcgtgctca 




caatctacaa 


agacgaggag 




cctaccct 88 


92101 




aqtctctqtc 






a ctaacaaa 


aca a actt 


92161 


tqqtqqccat 


atataatqaa 


caat°caqac 


tttacaaaaa 


ta ttta aa 

g g 


aaatcactaa 


92221 


acaaacaaqc 


aacaacacat 


aaaaaacaac 




aagtgggggg 


catct attt 


92281 


ccaaaotctc 


cacattcaaa 


acqtcca tt 


tccaacaaaa 




at caaaata 


92341 


acaagaaagg 


atggtccact 


cacaggaaat 


aaaaaagacc 


ttccctgagg 


aagectagae 


92401 


acctcatgtg 


ccagacaaag 


attttaaata 


aattgtctta 


aatgtgttca 


aagaactaaa 


92461 


agaaaccatg 


ggcaaataac 


gtaagtaaag 


caagaacata 


atgtatcaca 


aaatagagaa 


92521 


tatcaataaa 


gaaatagaaa 


ttataaaaag 


gaaccacaaa 


caattttgga 


gctgaaaagc 


92581 


ataataactg 


aaataaaaat 


ttcaatagat 


cagttcaata 


gcagatttga 


gcaggcaaac 


92641 


aaagaatcag 




ttatcagcaa 


tcatccaatc 


tgagaagtag 


aaggaaaaag 


92701 


aatgaggaaa 


aaggaacaga 


acctaagaga 


tcaagtacac 


caaaatacaa 


aaaaatgaga 


92761 


gttcaagaaa 


gagaatagag 


aaacaaaggg 


gcataaagaa 


tgtttgaaaa 


aacaatggct 


92821 


ccaaattccc 


caaatttgat 


aaaaagaaca 


tgattctaca 


catccaagga 


gctcaatgac 


92881 


tccaagtttg 


aaaactcaaa 


gagttacatc 


aagatacatt 


gtaatcaaat 


tgtcaaaaga 


92941 


caaagacaaa 


aagagagtct 


tgaaatccac 


gagagaggag 


cagcttgcca 


catatacaag 
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ggctctcaat 


gagcttaaca 


actgatttgt 


catcagaaac 


aataaaggcc 


agaaaccaac 


93061 


gggatgacat 


atttaaggtg 


c gaaaaaaa 


actg cc g 


cggc aaaaa 




93121 


accatccctc 


aaaaatacaw 


gaaaaatWag 


acactttcag 


agaaacaaaa 


actgagagag 




tttgttttct 


agttgacctg 


ccctccaata 


agtgttaaag 


aaagtccttc 


aggttgaaat 




aaaaggacag 


aagacagtaa 


c caaagcca 


gaaa 


gaagaa ac 


taataaaggg 




aaccccaaaa 


tcttcgtttg 


taactct tct 


cttttttcct 


a a ga a 


gtctatagat 




gcgtaaagca 


ataattataa 


atctacatta 


atagacacac 


a g a gaaa 


tgtaacaata 


93421 


ttataataaa 


gggtggggaa 


gacagagctg 


tataggagca 


aagcattttg 


tatactat tg 


93481 


acactaagtt 


gatatcaatt 


caaacaaaat 


tgttatgtgt 


ttaagataat 


agctgtaatt 


93541 


cccaggataa 


ccaataagac 


catagctttt 


aaatatacag 


caaaataaat 


gaaaacatga 


936 


atcaaaacag 


tacactaaaa 


aacaatgtgt 


taaacacaaa 


agaaggaaa 


at t tgaggaa 


93661 


ttgaggaata 


aagaatatga 


catatacaaa 


acaaatagca 


gatagcagaa 


ttcagtccct 


93721 


ccttatcagt 


aattatttaa 


atacagatgg 


actaaatgct 


gcaattaaaa 


ttcagagaat 


93781 


gggagaatgg 


attaaaaaat 


cccaagatat 


aatccaacta 


tatgcttatc 


cacaggggac 


93841 


caaatttaga 


ttcaaaatac 


acaaatagtt 


tgaaaatgaa 


aagatggata 


aagaaatgct 


93901 


atgcaaatag 


aaaccagaag 


agacctggga 


tggccatact 


aatatcagac 


aagatagact 




tcaagtccaa 


aaattttaca 


acaaagaaca 


gcattatata 


ttgatgaaag 


agtcaatcca 




ccaagaagat 


ataacaatta 


taaacataaa 


cacacctaac 


aacagaatcc 


caaaatctat 


4 1 


gaagcaaaaa 


ctgacagaat 


tgaagatgga 


aatagaccat 


tttacaataa 


tagctgaaga 


Q 4 9ni 


cttcaatatt 


ccactttcaa 


aaatggataa 


aacaaccggc 


acatgatcag 


taagaaaata 


4 


gaggacatga 


acaacactgt 


aaatcaacta 


gacctgacag 


acatacatag 


agcaccccac 




tcaac agcgg 


cagaatgtgt 


attcttctca 


tgtatgatga 


tcattctcca 


ggataggtca 


Q 

4 


tatgt taggt 


cacaaaacaa 


gtctcaataa 


cc ttaaagag 


actgaaacca 


tacaaagcat 




cttctccaac 


cac agtggaa 


taaaactaga 


aatcaatgac 


agaagaaacc 


ctttaaaaat 




tcacaaatat 


gtagaaattt 


aacaatattc 


tct fcaaagaa 


ccattaggtc 


aggtaatatg 




ttacaagata 


aat tag aat a 


tataataaat 


aataaatctt 


gagacaaatg 


aaaataaaaa 


4 


cacaacatac 


ccaaaactta 


aaggacatac 


aaaagcagtg 


cccagaagaa 


tatttatagc 


94621 


ttcaaattcc 


tacattaaaa 


ataataaaga 


tctcaaatca 


ataacctaac 


tttacacatg 


94681 


aaggaaccag 


aagaaaagca 


acagaaactc 


aaagctagca 


gaaaaaagga 


aataatcaag 


94741 


attatagtga 


aggtagacaa 


aatagagaat 


tgaaaaacaa 


taagaagact 


caatgaaacc 




aaaagctggt 


tctttgaaag 


atcaacaaaa 


ctgacaaaca 


tttagccaaa 


ctgaccaaga 




taaaaagaga 


gaaggctcca 


at tactaaaa 


tcacaaacaa 


aagtggggac 


attactaccg 




accttataga 


aataaaaatg 


at tataagag 


aat at tatgt 


acaattgtat 


acgaacaaat 


Q DO 

94981 


taaataacct 


agaagaaatg 


gacaaattcc 


tagcaacaca 


gaaactacca 


aaattgactc 


95041 


aaggagaaat 


ggaaaatttg 


tacagatcta 


taacaagtaa 


agagcttgaa 


tcagtcatca 


95101 


aaaactccca 


ataaagaaaa 


acctaggacc 


agatggct tt 


actgggaatt 


ctacctatca 


95161 


cttaaagaat 


taatatccat 


ctttctatga 


ggccagcatt 


accctgatac 


cgaaaccaga 


95221 


caaaaacatc 


ataagaaaac 


tacagactaa 


tgtctcttat 


aaatataggt 


gcaaaaatcc 


95281 


tcaatgatat 


tctagcaaac 


caaattcagc 


agcatattaa 


aaggcttata 


tacaatcacc 


95341 


aagtgggatt 


tatccgtgga 


atgcaagagt 


ggttcaacac 


acaaaaatca 


atccatataa 


95401 


tacactacat 


tcataaaatg 


gaggaaaata 


aaccacacag 


tcatcttggt 


tgatgcagag 


95461 


aaagcattag 


acagaaccca 


acaccctttg 


tgatcaaaaa 


atactcaaga 


aacaaggact 


95521 


agaagggaaa 


ttcctcagca 


tgataaaggg 


tatttacgaa 


aagccaaaag 


tactcatcat 


95581 


ggtcaactgt 


aagactggaa 


gcctgctctc 


accacctgta 


ttcaacatag 


taccggacat 


95641 


tctagtcaga 


gcaattcagc 


aagaaaaaaa 


taaataaata 


aaaggcatcc 


acattgcaaa 


95701 


ggaagaagta 


acacttcttg 


agggaaaatg 


gggaatgatg 


gctcgtgggt 


acagggtttc 


Q 5 » 61 


ttttgggagc 


aatgaaaatg 


atctaaagtt 


gactgtggtg 


atggttgcac 


agctgagtga 




atatagtaaa 


aat tatggaa 


ttatgcgccc 


gaaacgctga 


actacagcat 


atgtgaatta 




cagctcaata 


aacccaccat 


aaaggaaaga 


ttaaacaatg 


atctactcct 


tacctctgtt 




ctcagagcca 


tttccagccc 


agcccaggcc 


ccaggcaacc 


ctctgtgggt 


gacgcagcat 




tacctgcttt 


ctggctgtcc 


agtgttgtgt 


agcgcaaggc 


cctgggaggc 


cactgggttg 


96061 


gtcatgcggt 


aaccaagaaR 


gggagcaagt 


agt tgggtga 


caaaccctgt 


acatctgctc 


96121 


ctaactttcg 


aaacctggct 


gatgggaggc 


caatgtSacc 


ctctaacgcc 


ctcccatagc 




agcctcggct 


tgtcttctga 


cagtggcact 


gtccacccac 


aggcatcacg 


catcctgccc 


96241 














96301 


atcctgctca 


ctMtagggaa 


ggagaaagac 


ctcttcagct 


tgtcctgcct 


cggcaaaggc 


96361 


aaagcctcat 


gcatttgctc 


actcctccct 


cccagggctc 


agccggcacc 


tgaagctagt 


96421 


gtagcagaaa 


gagcccaatt 


ctgcatcagg 


cgggcctggg 


ttcaaatccc 


agatcctcca 


96481 




gtgccatctc 


tgagccttcc 


atcccttacc 


tagaaacaga 


gttctttcca 


96541 


cccatgccct 


tgtgagacag 


cgtaagtgaa 


aatgacatcc 


acacggtact 


ggaagcagag 


96601 


ctgctagaaa 


cccagccagg 


cagggtttga 


acttgttatc 


tccaaaaagt 


ctcaggaatc 


96661 


gtatttctgc 


ctgatggtga 


cctacttcag 


caaggaccca 


atgtagaaat 


tactctgcca 


96721 


ccaaaaaagc 


aggcctcaga 


ccccctccat 


cacctcccYa 


atttatgaat 


taaacaacga 


96781 


ggccagtgcc 


tgaggggtct 


gcagatgaga 


gtcagaaacc 


agcagccccc 


caccccacag 
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agacagg cc 


c gc aac a 


gacacaagaa 


ggee c 


gggg g c c 


g ^ g g c 




g 9 8 99 9 


a ^ c a 9 cc ^c 


ggece ccag 


gagggacac 


acg gcagga 


ca ggaa 




aaca 9 c 9 


9 9 cc c 


aaaaag gaa 


acac g g 


a ca g gag 


ccagcaattc 




cc a gca 


a acc aaga 


g aa gaaca 


tatatgtaca 


cacaaagact 


tgtccacaaa 


Q70R1 


ggcca cga 


ggagaagc g 


gecacagaga 


agatggecca 


gagaggaggt 


gctgtgtgca 




gaagcagc c 


gggaggaaa 


gacg gcaca 


ggecaaatte 


agattactga 


cacttttttt 




aa gcaga 




agaattacct 


atttttgact 


tatcttgaaa 


aactgaggat 




c ggcaacac 


tgaccccgaa 


tcttcctaac 


agcaacaatg 


atcagaactc 


agaggttttS 






attttttttt 


ctttttcttt 


tctttttttt 


ttttttttga 


gatagggtct 




gac g ca 




ag gcag gg 




gc cac gca 


acc c acct 




cccaggc ca 


agcaattctc 


^ gcC Cagc 




gctggttcca 


caggtgcatg 




ccaccacg g 


age aa 


aga fctt t 


ttggtagaga 


tggggtttca 


ccattttgcc 




cgggc gg c 


cgaacccc 


gagctcagg 


ga cca ccg 


cctcagactc 


ecaaaatget 


9 7 621 


gggattatag 


9ca ^ ^'Zf. 


?f a f ^ 9gC ? 


ttgtctcact 


gatttctgat 


c ^ tattttgt 




attatctttt 


^ c ^ a 


/~ gga 


caatgttgcc 


fct ^ tctaac 


t ctcacact 




ggaaatttat 


ctctttaatt 


tttagacctt 




aatgtaaatt 


tccatccacg 


Q7«m 


ttttgatatt 


tagtgctttt 


attatcattg 


agttgtattt 


ttgaagttcc 


actattattt 


Q7RS1 


cttctttgac 


ccatgaatta 


tttaaaagta 


ttttttttta 


atttccaaat 


aatggggatt 




tttgctttgg 


gggctattaa 


tttctctctc 


acaggcatga 


tggtcaggga 


acaagacctg 


97981 


ctgacactat 


tcttttagtt 


tctgtgagat 


ttgcacttaa 


ttgaccagtt 


gttacaagtg 




tttcctatgt 


gcttgagaaa 


aagaaataac 


ctctacttgt 


cagatgeaaa 


attcatcaat 




gtcaagctta 


ttatttgtgt 


tatttagaac 


ttccttatct 


gtgctgggtt 


ttttgtctac 




ttgatccaac 


accaattgga 


aaaaaatggt 


tggattctct 


caaaaatgat 


gataaatttg 




tcaatttctc 


tatttctaac 


aaagttcacg 


tcaaatatgt 


tgaggttatt 


ttattaagtg 


Q R9R 1 


actctacctt 


tagaactget 


ttatcatctt 


ggggaaatga 


accttttctc 


attaaaeggt 




gacctgtctt 


tccccagcag 


cgtctcctgt 


c 1 1. g a g g t: c t 


gttgagtctg 


agattgatac 




agcaatgtcg 


gettgecttt 


ggcggcctga 


taggtctgtt 


tccatacttt 


actttccact 




ttttcttgcc 




act ctggtga 


tagegtccag 


ccagattttg 


aatttggggt 




ttgtctcgtt 




qacagcctct 


gtctttgaat 


ttgtgagttt 


aatccattta 






ga ^ a jj tga 'j 


aaatctgggc 


ttgtttctcc 


catcttactc 


tatgatctca 




a g c g 


gt tctgt 


ccttttttct 




ttgtcttata 


aaaaaagaca 


98701 


cccggccgg 


gcScagtggc 


teaggectat 




ctttgggagg 


ccaaggtggg 




cggatccctt 


gaggecagga 


gttcgagacc 


agcctgacca 


acagggtgac 


accccatctc 




tacaaaaaat 


acaaaaaaat 


teggggggtg 


tggtggcaca 


tgcctgtaat 


cccagctgct 




agggaggctg 


aggcaggaga 


atcactggaa 


cccgagaggc 


agaggctgea 


gtgagecgag 


98941 


atcgagccac 




ft ggg aaC 


aaag gagac 


cca c caa 




99001 


ggca ccc 


ttt^t ?° a ? 
c c aa 


g cc g c 


accac g gc 


ggc gcccc 


agecaggctg 


99061 


caaccc ggg 


gacacacatg 


cccctccagc 


agtgagttca 


gacccttctc 


accctttggc 




c caggc cc 


cac cc g 


ccccctctcc 


tgagaggtga 


cacgccatgt 


ccacaccaac 




tccaaatctg 


ateategget 


ccagtccagt 


ccctgcacca 


getgetcatg 


ggcctttctc 




tctggggtgg 


ggctgggagc 


tgtggttcct 


gctgctgtgt 


cctgcactac 


tccacatttc 




ccagttcaga 


ggtgatcgga 


gccccctcta 


atatatgeca 


tcccagggcg 


gggcagcagg 




gcacaaggcc 


acttcagtgt 


caacaeggat 


ctcactgcag 


gaactaggag 


gggcctccct 




taggaagagg 


cactctggac 


tgttttaacc 


acag cccca 


cttctccctc 


ctgtctcaca 


99481 


g cccgagc 


ca acacac 


aege gc gc 


ctattctgga 


attatgeagt 


atta attt 


99541 


a a ca g 


cgc gggca 


tatgatgetc 




gggece gga 


gC i- 393 


99601 


caa cc ag 


aaggac cca 


gccagacccc 


aaaqgecaca 


tatta caat 


ggg c a 


99661 


gaag a ac 


agaa aggca 


aa ccaaaga 


cacagggag 


agattaKtgg 


tcactatgag 


99721 


ctgggaaggg 


gaa ggggag 


gac gc c 


gggtgcaggg 


cccactgg 


gg ga gaaa 


99781 




aC aga gg . 


ga ga ggc 


gcacaacact 


gccaaaaacc 


ac gga ac 


99841 


acac ca 


ac ca a g 


a a ag cc 


a ga 


aaaagg cag 


aggaagcaac 


99901 














99961 


ttaagcccac 


aagaatattc 


tctaagctac 


caggtcacac 


acacactcYc 


tctcactgtS 


100021 


cctgaagccc 


atagccccct 


ctgcctgact 


ctcctatcaa 


ctgtKgatgg 


cgtcaaagat 


100081 


gcacaSaaat 


atattttggc 


tgaaagaaac 


acaacagggc 


acaatgagga 


agggctgeca 


100141 


agcagcctat 


tagcagcatt 


tgtcatggtt 


catctgggag 


tetgectttt 


cccccagcca 


100201 


gactcttcag 


ggacagagac 


cagattgttc 


atactcagtg 


cccagcaggg 


cgctggcctg 


100261 


gggtgcttgc 


agacactgac 


tggatgaatg 


gatggatgga 


ttgggtggat 


gaatgagtgt 


100321 


gtagatgaat 


ggatgatgaa 


tgatggatgg 


atggatggat 


gatggatgaa 


tgaatggata 


100381 


aataattgga 


tggatggatg 


ggtagatggg 


ggatgggtag 


atggatgagt 


aagtagatga 


100441 


atggagatca 


atgatggatg 


gatagatgga 


tggatgaatg 


agtgggtaga 


cgaatgaata 


100501 


cacgatggat 


gaatgaatgg 


ataaatgatt 


ggatagatga 


atggatggac 


ggatgacaga 


100561 


tggataaatg 


gatgaatgat 


tggatagatg 


gatgaatggg 


tagacagaaa 


gactctgata 


100621 


ctggaaagtg 


aatccttcac 


atccatggca 


agtgaaccgg 


gtaacaaagt 


cagaggcaag 
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100681 
100741 
100801 
100861 
100921 
100981 
101041 
101101 
101161 
101221 
101281 
101341 
101401 
101461 
101521 
101581 
101641 
101701 
101761 
101821 
101881 
101941 
102001 
102061 
102121 
102181 
102241 
102301 
102361 
102421 
102481 
102541 
102601 
102661 
102721 
102781 
102841 
102901 
102961 
103021 
103081 
103141 
103201 
103261 
103321 
103381 
103441 
103501 
103561 
103621 
103681 
103741 
103801 
103861 
103921 
103981 
104041 
104101 
104161 
104221 
104281 
104341 
104401 
104461 



accatagcag 
catgcttgac 
ggctcagggc 
agcctcatcc 
caatcttggg 
agaatggggt 
gcccgcctca 
gtggcacggc 
atcctcgggg 
acacaattcg 
aactggctgc 
catgagcatg 
ggccttggtc 
gagggtcttg 
gcctttcgca 
aagttgtcat 

ctgaggccaa 
agctcaggga 
atgtggatgt 
agaggagaga 
ctcacactcc 
atcggcaggg 
agtgctgctg 
gattttcatg 
acgagtcaaa 
aggcctggct 
agagcaagga 
ggggtcacca 
tccccggggt 
tcctacaggc 

cYaKaacaaa 
ccatcatcgg 
taggtgccgg 
gccgggggtc 
cctgctccct 
ccccgctgct 
acaggacgac 
attcccgaaa 
ccacatccaa 
ggccgcctcc 



ttccagcttc 
gggcccagtg 
ccactgtcct 
tctcatcctg 
ccctctgcac 
gacctgagca 
caacccagac 
cccaccaggt 
tcgcctggcc 
agcctggcca 
tgaggaggct 
aaggctcaag 
tgtccagaat 
gcccggatcc 
gacaggggtg 
cacggtgatt 
cctgtggact 
atccggccgc 
acaggacagc 
agaccatgcc 
tcgtgatctg 



cgtcatttgt 
cacaggtcat 
tagggctgcc 
ccatcttccc 
gtcccctcac 
cagcaaagcc 
cctccattca 
acagacactg 
atcaggggct 
ggcactgtga 
tccccactgg 
ggcaggtacc 
caccgtggac 

cagggcccag 
cttgtgaccc 

ggcagcagga 
agctcctggg 
gtctgctctg 
gagacggaga 
actgcaaacc 
agggaggggt 
gcaggaacac 
tgaaacctcc 
taaaacattg 
ttagtccagc 
ctccgcaggc 
cagcccatgg 
ctctgtcacc 
ctgtgagtcc 
gtcctcgtgg 
ggagcacagg 
ccgtttgctc 
gagcccgggg 
tgtccgccca 
cagtgccgca 
ggctcccgct 
cgcaggtttc 
ggaaaataaa 
gaagctcggt 
tgacccccat 
tgaattagct 
tcttataaaa 
atctctgctg 
cagcgaacgc 
ggggctctgg 
acgctggggc 
ggccatcatg 
aatgtgtact 
gctcctgctg 
agacttgcag 
ccccagcctc 
gaaggcagcc 
cctgggcagc 
gatctgggaa 
cacagaggaa 
gagctgtcac 
tggaatggga 
gcctaccagg 
tgctctcatt 
agcctcctgc 
acctgctgca 
acctccagtc 



ctctagaggg 
gatgtggcct 
tctctgcctc 
tttctgtccc 
ctgaggtaga 
aaggacacca 
atggcacagg 
ctgcagcagc 
ccctactgga 
ctgctctggt 
caccagtctc 
agcgtctctg 
agagcagagg 
cagccagctc 
gcccctgggg 
gtgtcaggcc 

tgcggctcgg 
accatgggtg 
gagtttatgt 
aggcaggcgg 
tgtgttcgcc 
gcccagggtg 
tgatttttaa 
cggggtgggg 
accccgatgt 
caggcccaca 
ggagcaccaa 
aagggcccac 



Lgalgaaggc 
cacctgacgt 
tccaggctgc 
aacagaacct 
gcccggggag 
tccaacagct 
gatgcgctac 
gcctcacaga 
aattcctctg 
cctgagcgcc 
cccggccact 
gccaacagtt 
ttggaggccc 
gcgtcacctg 

ccttcctggg 
tgggactgcc 



gaactcctcg 
tggtttgggc 
agccatctca 
ggccccagct 
cccatggcac 
attgctcgca 
aggtccttga 
atgattctag 
aggcagccgg 
atacggttct 



cggctccctg 
tggcacagcc 
ccctcaggcc 



tgaggattgg 
ttgtgcctgt 
caccagtgag 
atctcccaag 
gccgtctccc 
tgtggaaaac 
ggccacttct 
aggcagggct 
cccctttgca 
aggccctggg 
YttctStgtc 
acacggtcYt 
ccccaggctc 
agcccacaga 
acacgggtgc 
tgccatggag 
gacaggacag 

tgtgcatgcc 
acgtgtaact 
ggccagcatg 
catccagagg 
gccaggcctt 
gtgtacactc 
cggcgaggcc 
acatacagac 
ctggcggccc 
gSacaccaag 
attgtgagtt 
tcccagttca 

tctggacaca 
gggggcctgg 
cagacccgga 
tccaggggct 
ctgtctgcct 
agatgggctt 
aacggaaccg 
aagtacattt 
gtgggctcca 
gccccgggct 

ccaggccccc 
ggctccaggc 
gccagtgtta 
catggctgtg 
ccagcctgac 
tcaacttgac 
agcacctagt 
gccctttgtt 
cctgccacgt 
accccacgga 
gctaggaatc 
gaaaagggcc 
cgagcgccac 
gtggggctgt 
gccaggcaag 
cccttcctgc 
cctgatccag 
cctcttccgt 
cctcctccac 
ctcaccctgc 



atttttgggg 
gggtcaggaa 
gagcagactg 
ttcagatcct 
acctggctgg 
aagattatgg 
caatgtgccc 
tgctccttcc 
gggcagccct 
cagatgctgS 
gtctgcctac 
cccagcctct 



tcaggtgg 

aacaggaa 
ttgggcct 
accctgggct 
catttcctac 
tgcgtgtgtg 
accaaccaga 
agtggggggg 
aggcacagag 
ctgaaggaac 
aacatttctg 
gcccaagagc 
gagaagactg 
cggctgcctg 
aaagggaggc 
tcaaaggtgg 
aggcctcggg 
gggagctccc 
agggcagaac 
aggacatcca 
actgggcacc 
gctccgaacc 
gggctggggc 
catcgctgcc 
ctggaaaggc 
caactccgag 
gtgaaggtcc 
gctcaaccct 
agattgtaag 
ggccgacccc 
atcagtgtgc 
aaggttgaga 
tccagctctg 
ggctctgatt 

ccctgagctc 
ccttttcgcc 
ctcctgactc 
gaattgtcaa 
gtccctgcag 
tgcagctgga 
agacacaggg 
ctcgacaggt 
gatctgggtc 
gagtatcccc 
acgatggcgc 
gccgcagtct 
gctcaccctc 
gacggagccc 
ccctcctgcc 



ggcctgactc 
cctggctggg 
ccaagcaggg 
tggtgtcttc 
ggagtgtgtg 
tgggcMtagg 
acccccaaca 
cccgcagctt 
gggagaccgg 
gagcacaggt 
gccaggtccc 
agaggacggg 
gaggcctcag 
cctggtgccc 
atggatgaaR 
tgagagagag 
gtccaggagt 
ccaggcacga 
cctcccaccc 
tgaaaattga 
aagccgttgt 
tgcagggctg 
aggtcaggcc 
gcccatcact 
ccaggcctgg 
ccgaggtggt 
acaaatccaa 
gggatattgc 

ggcacgggta 
agatgcacca 
cggcgccacg 
cctgcRccct 
tccaagaaca 
ctctcttgac 
gcagccctgg 
ccagcactga 
tacacggctt 
taaaatcaat 
gtgtctacct 
ccatacgaga 
ggacagatgc 
ccctccagag 
tgggtgggca 
cgccccccgc 
ctccccgcag 
ctgctggacg 

aggcagagca 
tcgtccaaaa 
ctcaaatatg 
accttcccag 
ctgaggtcca 
ctcccagcct 
ctctgaccaa 
tgaggggctg 
ctccctgcac 
ctcggaaacc 
acagaagcca 
acacccccaa 



127 



PATENT 

Docket 524593008900 



104521 
104581 
104641 
104701 
104761 
104821 
104881 
104941 
105001 
105061 
105121 
105181 
105241 
105301 
105361 
105421 
105481 
105541 
105601 
105661 
105721 
105781 
105841 
105901 
105961 
106021 
106081 
106141 
106201 
106261 
106321 
106381 
106441 
106501 
106561 
106621 
106681 
106741 
106801 
106861 
106921 
106981 
107041 
107101 
107161 
107221 
107281 
107341 
107401 
107461 
107521 
107581 
107641 
107701 
107761 
107821 
107881 
107941 
108001 
108061 
108121 
108181 
108241 
108301 



acaagccccg 
tggggaactg 
ctgtggcact 
ggctgtatga 
tgctgtgcgt 
ccctgggatt 
gcacgctggg 
cattgctgcc 
cgggtctgtc 
gacattccca 
aaggaaggga 
gtcctcaacc 
cagaagaaag 
aaacgcacag 
ttaaagtggg 
agtgtgagga 
gaccagtagc 
aggaggcgtc 
agctcctggg 
acgcaccaac 
attctccccg 
ggccccagtt 
gtccaacaaa 
gacaggaggg 
tgatgcagga 
gtcggcacct 
gggctcatta 
acggtaagtg 
ttggacagtg 
taaatgacac 
tgaacgcaaa 
aacagacttt 
gggaaaagct 



tcctcacccc 



agagtaatca 
agcgcaaaaa 
cgtgctggtg 
cgagtggtga 
cccctgatgg 
tgccgggtca 
gtccccaccc 
cgggcagctt 
ggaaggctct 
gtacgtgcag 
cagtggcagg 
ccatgggacc 
cctcccggag 
ttgggggctc 
ttctttcctg 
agctgtggag 
aacctgccac 
agcctcaggc 
tgattccttc 
gcctccaccg 
ctccaaaggc 
aaagcgatga 
cctcagttcc 
ctcagggctc 
tcccatcatc 
caccccgttt 
ctcccctgcg 
agccccagca 
aatgcaggaa 
cctaagcagt 



tcccaagtct 
ggtcagggct 
gccgggtaaa 
ccacaggtgc 
agcagtgctc 
ctcaggcacc 
gttccagcat 
aataaaggac 
gcccgtgtct 
tcggggcctg 
agtgatgaac 
tcccttgttc 
catagggccc 
agtgggacta 
cacccacagc 
agtgccaagc 
cgaccgggct 
cgtgtgccag 
aagctaacgt 
tctgcagccg 
cagatggggt 
cactgcactg 
gtcaaaaggc 
cagtttcctc 
caagaatcct 
ctccaggaac 
gtagccctgc 
catgaagagt 
cttggctcca 
tgtgcaacta 
tgtatcgaac 
taagacccca 
aacccaggga 
gcaacgccgc 
aggatttgga 
gtctgttaag 
ctgctcaggg 
gcaacagata 

ttgtgtgggg 
gcctgcaaag 
gccacagttc 
caaggggaag 



caagaggagg 
ctgcccccgc 
ggaaaacacc 
tcagagtgat 
ccccaccagc 
caccagccag 
cttcacacca 
ccatctgtca 
ctccaccgca 
gtgccaccca 
aggtactagc 
ctgcatgtgc 
accaacctgg 
tactttgttc 
gggctgtgag 
cccagtcctg 
aatacacttg 
agagctgcta 



tcaggtactt 
tccaggcctc 
ttcattgtgc 
gtgcctgtct 
gatctgttgg 
tttatgtcca 
cagcccgacg 
aaacgggcct 
ggacagggcc 
agacagttaa 
actgggcacc 
gaccacagct 
agcactggcc 
aaaaagtgat 
tgccggatgg 
gcaaaggccc 

cctccaatgc 
tcctaggcac 
gcattcctca 
gctaagtgac 
ttaagtggag 
cacaggctac 
ggtgtcagca 

atctgtaacg 
aggagatgat 
gcgtcacgtt 

gcccaccttc 
acggggcagg 

atgacagaaa 
ggagataaac 
agatcactca 
tttgctgccg 
gaaaccagga 
cgctgggatt 
ccagggctcg 
cctatttgca 
agagagtttg 
aacagctggt 
gcctagaggc 
ctgcatccat 
ctggaccggt 

tggcagtgca 
cccgtgagtc 
tccctctggg 
gggtgccaat 
acacctgccc 
cctcccagcc 
cgtcccggaa 
ccgggactcc 
cagggaatga 
tccctcccgg 
tcttccttct 
tgttccctct 
ggcgcctccc 
acaggattct 
cccagagaac 
tcagacacat 
tggcagcagg 
atgctattat 



cgcatctgct 
agcccaacgt 
aggtatttgt 
ggtcccagag 
atgaatcaag 
gagacaaccg 
gcatccccag 
gtggcagtca 
atcttcctcg 
ctgtcagaaa 
cccaattttg 
ccatgagacc 
acttgatttg 
tacgacttct 
gtcacaagcc 
agggccaccc 
agtctgcagc 
ctgcagggag 
tatgcttatt 
aaatgctgca 
ctggagacac 
ttccaggcct 
cgggttttgc 
gccctgggtt 
ctgtttctac 
tgggcacagc 
gtacctgcag 
cttcttgctc 



aggcaaagga 
tcctcgccag 
gagcagcaaa 
gctgctgtgg 
tcctaattca 
agcgcgtctg 
ggaacagaac 
caccaggagc 
agcgctgcct 
agccacacac 
tcatcccggc 
cctgctagac 
gacagcccgc 
tcccagggcc 
tgggaacaca 
ccagggcagt 
tctgactcct 
ggccaccccc 
acgtgacctc 
gctccacctc 
tccctgcttc 
attctcaccc 
ccctgcacgc 
gccccttggt 
gcctggaaca 
tcccttacct 
ccactgtctg 
agggactcga 
ggtggacaca 
caaggtgccc 
tcccccatga 



gccctggccc 
ccagtgactg 
ggactgacta 
cctggcccag 
ggagagagga 
cgcagggaac 
ggcagaggct 
caggggcaca 
ccatctgcag 
gtgctggtgc 
gccactgtgc 
acagggagga 
cagggcacag 
aaggcaggaa 
ggcgaagtca 

cctgacactg 
caactgttgc 
ctttcttttc 
aattccactg 
atcctcactc 
tggtgcttgg 
cagcccaggc 
ctaagcctgc 
agcttactat 

cgtgctggca 
cctgctgttc 
tccaggacta 
tcggtgaatg 
ttcagactca 
gaaagaggaa 
aaaaagagat 
cttaggaata 
taaccaaaga 
atattaaaca 
gagccagtct 
tttagtcact 
ggctggctca 
ctgaggtcag 
agggagcatt 
gcggcccctt 
cctgccctgc 
cagtctgggg 

cccacagagc 
tggctgcctg 
tgggtcttgg 
gcagagcagt 
gcacgtggcc 
accagcacac 
tcaggggcag 
aaactcatcc 
cacgccgtcc 
cgtaccgtga 
ctcccactcc 
cacgctgcgg 
cccttcccag 
gccttctcac 
cgagcatctg 
tcgggttcct 
tagcaagaac 
tctctgagcc 
tttggggagc 



cttttcctcc 
cgggctcact 
gtgcactgtg 
tgcccggtcc 
aggggagcag 
agtggccaag 
gggattcacg 
ttcctgatac 
tctggcatca 
ttgggcctgg 
agtgccctgt 
agcagagtca 
tgcaaaatgg 
tgacagagtg 

gggcccatgg 
tggcaacaac 
tcccggagtg 
acggatattt 
taagcaaaag 
accctcgctg 
aaggccctct 
tctgagcggg 
tcctccctcc 
ctgtgcaacc 
acccaccgtg 
gacgcggtca 
ccgctactgc 
ccgtgcctat 



ggc 



itgtc 



ctacggcctg 
aaagggctta 
ggagacatgc 
agcgcaaatt 
ctaaagcccc 
ggctctgccc 
cagtaggtgc 
ggccgccatc 
ctcccaggat 
ccagctgtct 
ggggtggaca 
acactcctaa 
tccctcccca 

acagcctctg 
tggggctgca 
ccttggcacc 
ggaagcgggc 
cagaccacct 
ctgcccggcc 
gatgtgtcct 
tgaacccact 
accgaaaacc 
ccgtcccttc 
cccattacgt 
taactccagc 
ggctgggctt 

ctcaccaggt 
ccgctgctac 
gtgacgagtg 
tcagtctccc 
agtattattc 
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108361 
108421 
108481 
108541 
108601 
108661 
108721 
108781 
108841 
108901 
108961 
109021 
109081 
109141 
109201 
109261 
109321 
109381 
109441 
109501 
109561 
109621 
109681 
109741 
109801 
109861 
109921 
109981 
110041 
110101 
110161 
110221 
110281 
110341 
110401 
110461 
110521 
110581 
110641 
110701 
110761 
110821 
110881 
110941 
111001 
111061 
111121 
111181 
111241 
111301 
111361 
111421 
111481 
111541 
111601 
111661 
111721 
111781 
111841 
111901 
111961 
112021 
112081 
112141 



atgatttggg 
ggggagcagc 
agcagcatta 
gcattattcc 
tattccccca 
ccatgatttg 
attaatgaac 
gccacctctc 
tctgggtgtg 
gggttctcta 
gggaccaagt 
cgtgtgtatg 
agagcaggag 
gctcagagct 
cgggtcaaag 
tgtctgccaa 
ctggcacaaa 
tcctcgctgc 
gcagccctga 
agaggcctct 
tgcccatgtc 
gggtcccaag 
catccaaggc 
gaagcctgcc 
ctgatcaaac 
gtctgcatgc 
ccaacacaga 
gaggcagagg 
ctgtgccctg 
cagcacccag 
ccagaacaac 
gctgggcgga 
catgccgggc 
tgagcacact 
ctgcaccaaa 
gagcagccct 
ctcgagttgg 
cacctgccca 
gctccacccc 
ctgagactca 
ctgtcaggcg 
cactgtggaa 
cagcccaggg 
gagccagcca 
cagcctcgga 
cgtgggccac 
aggacaggag 
agccgcctcg 
gctcagctcc 
ggcgctgaac 
gagagagagc 
gggaagtgcc 
gggctcaccc 
caacacttag 
aactcgtgag 
tcttgagatg 
tccttctaag 
gactgcagca 
ccaggaatgg 
ttagccccta 
gctgttttaa 
gacgtcgaca 
cccaacaaag 



tggggagcag 
gagcagcatt 
attattcccc 
ttcccccatg 
cccatgattt 
tgatttgggg 
gggagcagca 
ccgcatggca 
cacacaggcc 
ccgacagttt 
ccagctgtgc 
caagagctct 
cacaggcaca 
agagagaaga 
ccttcagggc 

cctgcagaga 
ctggcagagt 
tgctggggaa 
tgctggtacc 
gtctagggac 
tctttcaccg 
caggtcccac 
acagatgtca 
tctgccatgc 
gccagatgcc 
cagggactgt 
tccaagggaa 
agggaaaggg 
cgtaattcca 
gttctgagat 
agcatgtcgg 
gctgagcgcg 
ggggctgagc 
gacccctgca 
ctgcccgctg 
aatgcagctc 
ggctgcagcc 
cggtgctcca 
tgaaccagaa 
gggactaaat 
gagtgccagg 
taaggggggc 
ggtgggaagc 
ggcccgggag 
gaatgtgcag 
atcaccgcgc 
cgccgcggcg 
ggtttcacaa 
agggtggaaa 
tcagtgagtc 
acctatcacc 
cggttggaga 
tgcaaatcca 
acatgtcagc 
cattacctta 
gggaaagtaa 
agggagagct 
atgttgccac 
attctgccct 



gccattaagg 
gctacagggc 
cacacggccc 



tattattccc 
attcccccat 
catgatttgg 
atttggggag 
ggggagcagc 
agcagtatta 
ttattccccc 
ttcaggtgcc 
cagtgcccag 
gggatgggag 
tcgtggcgca 
gagagacatc 
cgaaagaagg 
cagacgccag 
ccagagcaca 
atttttatag 
attctgagag 
ttggacctcg 
gagcttcctg 
caacgcctct 
ccactatggg 
tcatccatgg 
tgctccttgc 
cagacatgaa 
tggggacctg 
cccgtgagca 
taccttgtgc 
ggaaagaaag 
gacggagagc 
gcagcccctg 
cctgaggggc 
gtggagctga 
ctgacccctg 

ccaacagcag 
cccaccccac 
ctcagaacca 
aagccagggc 
ccaaagatcc 
accatgtcgg 
ttatgtcgga 
cccagacgcc 
ccagccctcg 
acccacaggg 
ctggcccctc 
cctcggcttt 
tggcatctcc 
tctgccaagg 
tcagtgcttc 
gagaggatga 
ggcaacagtg 
acgccagagg 
cacagagaca 
gccaaagccc 
tgaatcatga 
tatggcagaa 
cctggatggt 
ggctatagaa 
aagccaagga 
ggggcctccc 
tcagagttct 
tgtgggtaac 
agaaaggctc 



ccatgatttg 
gatttgggga 
ggagcagcat 
cagcattatt 

ttcccccatg 
atgatttggg 
cggccttttc 
gtcacctcct 
gcgtgggagg 
gcccacacct 
ccagccgaag 
tgagacagaa 
cacgcccatc 
gcgctgccta 
aaaatatttc 
gagatggggc 
gacccaaatg 
tccccttcag 
cacctgtcgc 
atgcagtccc 
ggcagccagg 
ccaaacttca 
aagatgccct 
aaaagagggg 
gatcacagct 
aaaacccgaa 

aagtgtcgca 
tgccactgca 
tgggtgagct 
gcacactgac 
caccagcgct 
ctgcaccaac 
catgccgggt 
tcacccacct 
ctggccggtc 
caaggccacc 
aggcccttgc 
gaggggagag 
ggcacagggg 
gtggaggggg 
ctggggctgc 
cagcgctctg 
ctgctctccc 
cccagcgccg 

gtgcgctcag 
tccctgggtt 
gctgagagga 
gcttggcagg 
ccctggcctc 
ggtctgggca 
tctctccatt 
cccccggacg 
gagactttgc 
cccagcgggc 
gaggagaagc 
acgccaacag 
gaaggaacca 
gagctcccga 
ctggtacagc 
tctgacctgg 
tcctgggccc 



gggagcagca 
gcagcattat 
tattccccca 
cccccatgat 
catgatttgg 
atttggggag 
gagcagcatt 
ccaaccagac 
gtcctgtcct 
cgtgagctgc 
cacggcaagc 

ggagagaaaa 
tgactgcgcg 
ggattaattt 
aggcttggag 
ctgtggtcag 
cagtttctga 
atgtccctac 
cccacccctc 
tttgtcccca 
gcatgaagga 
ccccagccag 
ggacacatgc 
gcatgcccag 
tcacccctga 
aagctctttg 
ctgtgtctgt 
gtgagcactg 
cccaggtaca 
gctctgggcc 
ccctgcacca 
gacccctgca 
agcagcatgc 
ggggctgagc 
agtgctggaa 
ttccagaact 
atcctccggg 
cagccacatt 
gtggcggggg 
agttgctgcg 
agcacaccct 
aagcctcgct 
agcagcagac 
cggctgaccc 
gcccccgccc 
ccaagggccc 
ccctgaagca 
tcccggggcc 
ctccagggat 
gcccaggctg 
aggggtctgg 
caagacgctg 
ctcccttagg 
tccatgtcct 
aaatgtgttt 
cctaaatgtt 
cacgtgacca 
cccccaggag 
gccctgccag 
actgtgagga 
agccacagga 
cattcagcaa 
cccagccccc 



ttattccccc 
tccatgattt 
tgatttgggg 
ttggggagca 
ggagcagcat 
cagcattatt 
attattgagc 
cagcgcacca 
gcaggctctc 
gctcctggcc 
cccagccaaa 
ttgtcccaaa 
atgaagccag 

caaagagaaa 
ctggctgaag 
aggcctccgg 
gccaggcctg 
agctgcctct 
tctatcccca 
ttatgcaggg 
agcctcacct 
caggctccta 
ctctctccca 
cccacagttg 
gccgtgtact 
gaggacattc 
ggaagatggc 
ggcaggcgcc 
gtggagatct 
ccttagtgag 
acagcagcat 
ccaacagcag 
cgggtggagc 
acactgaacc 
gcatctgtgg 
gtcccctgca 
tcaggcttgt 
tacctggtcg 
aagggcaagc 
cacgtggggg 
ggcacgtctc 
tctgccttca 
tccagccaga 
caggcaactc 
accccttccg 
ggtctgcacg 
cgcctcccgg 
tgggctgcag 
ccagcgccga 
accagggtat 
ggagtcactt 
gccaggtcca 
acccaaggtt 
aatcccccaa 
aaatcagggg 
atcacaagtg 
gggaggcaga 
ctgggagggg 
caccttgatc 
atgcaggtgt 
agctgagatg 
attggcaccc 
acctgggcca 
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112201 
112261 
112321 
112381 
112441 
112501 
112561 
112621 
112681 
112741 
112801 
112861 
112921 
112981 
113041 
113101 
113161 
113221 
113281 
113341 
113401 
113461 
113521 
113581 
113641 
113701 
113761 
113821 
113881 
113941 
114001 
114061 
114121 
114181 
114241 
114301 
114361 
114421 
114481 
114541 
114601 
114661 
114721 
114781 
114841 
114901 
114961 
115021 
115081 
115141 
115201 
115261 
115321 
115381 
115441 
115501 
115561 
115621 
115681 
115741 
115801 
115861 
115921 
115981 



agcctcacat 
tcgcctgctc 
ttgctccagc 



cttccatgga 
ggaactgtca 
ctgcggcctg 
caggaggggc 
acgctcagcc 
ccttccttgg 
gggaggatgc 
ggaggcagcc 
gtgtgtgcac 

cttctacctc 
ctatatagtg 
ccagggctgg 
tttgctggct 
ccaaccaggc 
gataggggct 
ctcactgagc 
cagaaatttt 
attttcttga 
gagttgagga 
ggccagggct 
gtattcttgg 
gtgcaattat 
aggctctgag 
ccacatgctc 
caaggtcagt 
tgtgcacgtg 
ctgccaggct 
gggtccagtg 
gccctgggga 
cctcccgggt 
gtagctggct 
gtaccttcta 
gaagcctgtc 
ggaagcacag 
atgatgcagc 
cagattctcc 
agtgagattc 
cgttttcaca 
ttattggact 
gcacgtctca 



acccccatga 
gctacaagat 
tcaagctccg 
agagaggaca 
agagctggcc 
gggacacctc 
ctggccaggt 
cagggaggag 
ctctcccggg 
atgctttaaa 
atcttggatg 
atcaccctga 
ccctgacgct 
cagggcaggt 
tccagacgtt 
ctgccagggc 
ctatacacac 
aggctgtgcc 



ccgctgtgtc 
cccagtcctg 
ctcccccacc 
ttgtgagact 
ggctgcccca 
tgcctgaggt 
gaaaactcct 
tcttcctggc 
acatggagga 
tgcagcgagg 
tgcagcctga 
aaggtcactg 
agcagcctgg 
aggccggggc 
tgcatcacag 
agcacgggag 
aggcagagag 
gagccctgga 
acttcctgag 
gtctctcgta 
tcagcgccgt 
aagccacatt 
taggatgttc 
gggtctgtcc 
ggacaccacg 
tctccacgtt 
ctttttttcc 
tttgcagctc 

ctctgggaag 
gtcttggagt 
gagactggca 
agggaggcag 
ggcgtaagtg 
gtgctgtgca 
gcgtggtgcc 
tgggaagaaa 
ctggattact 
ggacacgtca 
cacaagccaa 
cctggagcct 
atttcagact 
atgctgataa 
tacagttcca 
catggcagca 
agatctcgtg 
ttcaatcatc 
gagattcagg 
tgaggactcc 
atgccacagg 
ttgggaaggg 
aaccactgct 
tcccaggcca 
agaaaaggac 
tgttgcctaa 
accgtcattt 
acagagacta 
gcagcccgtg 
gcagggatat 
gtgtcacctt 
gcttttgggg 
cacactgata 
acacacatac 
cctcactatg 



agtggggctg 
tggtctgaca 
agcctccctc 
ggagggcgcc 
cagctctctg 
ccgcccccac 
acccaactca 
ctggtgggga 
gctgcccagt 
ctgagaggct 
cagcagcagg 
gcacgttgct 
tagcagttcc 
gagccccagg 
aaagcatggg 
gggaccccag 
ctgatgtgga 
cctggccagc 
tgactcccca 
cccgggtcgg 
ggctggattc 
ttaacatcag 
agctgcacca 
ctccttgcag 
tgtggctgca 
tccatcaaaa 
tccactgctg 
ttgtctttct 
tcggggtctt 



gaacagagct 
acactgcaac 
atccccacct 
cctccagccc 
ccccgccgcc 
aggcaaagtc 
agggctggct 
gccggcgctt 
cccgtggtgg 
cagaggcttg 
gtctgacttc 
ccaggatggg 
agagcagccc 
tagccaatgg 
gcatgacagc 
ggcttctgtg 
gtccttgtgc 
gacccctgag 
cctgggccct 
ctggagcctt 
gaaagttcac 
gacccaactt 
ggcctcccag 
ttcccactgc 
ggagagaaat 
tccttgttgg 
gcaaatggca 
tctcactgtg 



ctaagacagg < 
cagccgctcc 
cttagacatg 
caggaccagg 
aagcctacag 
ctgcgtggcc 
cccaaagatg 
gggctttgca 
cgggtgagtc 
gaagaggagg 
gggaggctgg 
ctggagggag 
tctggcccac 
agacataccc 
cgtggctggg 
gccgagagag 



gtgt 



tctcactggg 
tggagacaca 
ctcctcaaca 
gtgccctgtc 
cagagaacag 
ccccaaaagg 
gcagcactga 
cctccccgct 
gctgcccagt 
tctatccgtt 
ttagtgaaaa 
ggcttctcgg 
cctgccaccc 
ccccggcaga 
aggcaggaag 
tatcagggtt 
acacacaccc 
cctggccaca 



aggtccctta 
cagggaagag 
gcctgcctgg 
tgctgctcct 
agccttgtac 
ctggatcctg 
gatgtgcttg 
ctaaatgtaa 
gggcgacatg 
cagcccccag 
cacagccctg 
agagctgtga 
gagactgggc 
gaggcctcac 
agagagcttg 

cctgtacccc 
ggattggaga 
ccacccagcc 
gcaggaccct 
ccctcacccc 
gcacacactc 
tcccccaggc 
tgattccctg 
ggaaaacagg 
agacagatgc 
acataaatag 
aaactgccag 
ccctcagaat 
gcacccaggt 
ggacacagct 



cgccactgaa 
cctctctctc 
cagtacgagc 
tgggtcattc 
tgggctctgc 
cactcttccc 
acagggtgga 
gtgggctgca 
gtcctctgtc 
cgtgcctcgt 
gcaccggcgg 
tcttccttca 
aagcacctgc 
acagaggagg 
gggaccgtgt 
tcccactggg 
gaaagagggc 
caggacctgt 
gcctggtcct 
gccaagccag 
ctcagttcat 
ataaaaagtc 
ctaccaaatt 
acgccctgtg 
tgtttctttt 
gaaagccagg 
cattgcctgc 
ctgtgtgtct 
tggcaacctt 
cagcacaggg 
acgcgggcgg 
ctcccacacg 
gctgtgcctc 
ccgcgtgacc 
gtgagggcca 
atccctgcaa 
cagtaaggct 
tcaaggcatc 
atccgggggc 
gagcggggat 
tccacccttg 
gagaggcatt 
aacttacaaa 
atcatggcag 
tgcagggcaa 
gaacagcatg 
aacacatggg i 

ggatgagatg 
agcacagtgt 
tgcctgtcag 
tcagagctcg 
tctgccaagg 
ttgcccctcc 
atccctggcc 
aatctgcaat 
aggctccatg 
acatcctgaa 
ccacccccca 
cagggtgctt 
ccagcacctc 
tcactggaag 
cctggagttg 
tttatgtatc 



tggatgagtc 
tcctgtgcca 
ttgctgaaac 
aaggctgctg 
acacccaggg 
ccaggatccc 
gactgtgggt 
ggtggcagcc 
tcctggaggg 
ctgcccgtca 
tgtctggggt 
gcccagcagg 
ccctcctctg 
gcagaagatg 
tttgtttggt 
tgtgcagggc 
tttgtctgga 
ctctgtgtct 
ttcgcatggt 
ctgttgccga 
cctctcattc 
tgatgtctgg 
gaggtcaagg 
cccatggcaa 
tagtggagat 
ccaagggtgg 
caggccctgc 
gaaacagaga 
cacagagtcc 
ttggcttctc 
tgggggagga 
acagggtgga 
tgggctatat 
cagacgggcg 
ttcacatcct 
cctgagacgg 
cttgagatga 
cttgcaagag 
agagattggc 
gggccagaaa 
atcttggctc 
tgtattagtc 
agaaagaggt 
aagatgaaag 
ctcccgtttt 



gga. 



agao 



ccctggacac 
aatgcctggg 
gcctccggtc 
ccagcatggc 



tgtctgtgaa 
ccagcaactg 
agtccccgag 
aaggaggcac 
ctgggacacc 



ctgccagctt 
aaagggtccc 
tgtgccaggc 
tcccagttgc 
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116041 cactctgggc cctgaaaagg cccaggcaga tggaaagctg gacgcctgag gtcaccgtgg 

116101 gccctccacc cagctccggc actcactggg cctggtgccc ttccagtcag tgtgtcgatc 

116161 tcagcccagg gccacgtggg aacgattagg tgcatggcgt gggagctagc tcactgaagg 

116221 cccgagcccc tcatgcaggg tctgctgaca ccagacagga acatcaagca aaggggactt 

116281 ggccccagcc ccaagttctg ggatccagac ctatagcccc aggcccacgc tcacatccac 

116341 agcctgggcc ccaggcctgg agctaaggct ggacaggctc acctcctccc cgacagccca 

116401 ttaccaaacc cccaccttcc agcccagcct cacccctgac ctggcagctg gcctcaccca 

1164 61 gcgtcttcta aggccaatgg tttctgggct tattctgaga ttgaaattcc tttcctgtgt 

116521 tctgaatttc aagttctcat taaagatatg tggattgggg gtgggggtgg agggaggatt 

116581 cagcagaaga tgccaaacac cagcagaagc ccagaaaata aggcaaggcc ggccaggctg 

116641 tgggggaagg gctgccatgt tcaggaagac aagcaccata tggaatctgg gatcagagac 

116701 atggccaagg gcacgtggcc agctcacctc acctgagcag aggcccccag aaccaccctg 

116761 aatcttccag gagaggccag ccagaagcac caagcccaca gcacaagtga ccagcccctg 

116821 agctctcggg gacacagggg ttgctaatgc cactgggctt gcccgacagc agctagctag 

116881 caaggagcaa agctggggcc ctgcccccgg gtctggagca ctcgctcagc cccttccacg 

116941 ctggccctcc tagattactc ctccttcctt ccctgcaaaa tagtccaagg gtggggataa 

117001 gtcgaggctg agaccacagc tggaaaaggc tgggcaggca gtacccagtg gcccggctgt 

117061 ctgcagcata gcagacgctg atttgtcacc aaggaacaag gggccaccgg ttgcactgat 

117121 tggctccaga gtacattggg cttcctgatg gaacaattcc tttgcaaagg tatctgatgc 

117181 cgtttcatga aattagaagt atgcctgcca ggctgtaccc aatttttctg tcaagctagg 

117241 aagtcagaca gacctacagt aaatgaggcg gggctgctat ggagtgacgg gagtacctta 

117301 gtaaatcaca gcacgggggc tggcccagcc caggcccctg gagcactcgc tgtgaacgtg 

117361 caggtgtgcc cgatcctggg ccccagactc cagactctcc tcacccaggg gtcctgacca 

117421 gaggctgagg gctgagctcg cagcttccca ctctcccagc aagagccaag cacaggccag 

117481 gagtccctga ccacacgggg gaccctgaga actatgggcc tgcagaacca catcagacac 

117541 caaccctggc ttcaccacaa tccaggacca gagactgggg gatgctggga ggtgtgggct 

117601 ggcacagtga ggccagatct gaggcgacag tcatggactg tcaccatggc cgcctggcag 

117 661 ctcctttcac gtttctgggc ccagggcaca cgtgggttct gagttgaaaa agacagacgc 

117721 ccatgagtct gtgcagagac tgctgaggct gctgggagca cctccagagt ccttgtaggg 

117781 cacatgcctc cagggatcca gaaagagcat gcagggcctt tctccctgag cagccaggtg 

117841 agggttggga cctaccacag gcaccctccc agggaccatg gcagcacaga gcaggagagg 

117901 cgctgtctgg ggctctggag gcctcttcac aggcacagac tcagagacgc aggcactgga 

117961 ctcaggcctg gccgggggcc cccgccccgc tccacgacac ttgcttgcct ttaggcaaca 

118021 gaagatgggc ccttggacag ccttcatagt ccaccatgga ggccccagac atgaaggcag 

118081 aggtgtctgg gctgcttgtc acccacatgc acccagaaca gaggccccag ccccagcagc 

118141 cctgctacgc cctgccgtca gccaggcagc tggcccattt ctgagccact gagaccgggg 

118201 ctcagcctca gcagccaagc tggtctgtgg agagctgcct gagtctctgg gatgctcccc 

118261 tggggaccag ggcctcagag ggctctccac acagcctgcc cacctgcagt agcctcacct 

118321 gtgtccaggg cctgtggccc cccgagagga ggggacgtgg gtggccggcg atacaccaca 

118381 tgcacacggc cttgctcagc ctcctgcgcc gccagcccct tctccaaggg ttcgatgaag 

118441 aactcctcct cctccatccg gatcagacca gcctgcggga caaagacaac aggatcagat 

118501 ttccagcaca caaaagacca aggaagggga acagcgagga gaccccctca gggagtacct 

118561 cccacctgaa gaccctgccc agcaccccag ctggtgctca gcttgaaccc tgggctggcc 

118621 ggagggtatg aacatgtggt cccagaggcc gccccccgcc gcccaccctg tctaggaaag 

118681 gggcctctgt ctgggtcatc tccaccccta tacaccaacc ctgtctgctt tcagaaaagt 

118741 tactgtgttt ataaagtcaa aaccgtccag agaagcaggg agacattcaa gggcagctgt 

118801 gggcctgtgc gtgtccccag gagggcatcc accaaagtgc atcaatatgg gccaagtgct 

118861 gcccccactc ccaggagccc agagtccaag acaaaggtca aaacagccag ccggggattg 

118921 aatctgccca cacacatgtt ccatctgggc caaacaatat ttttaaagct ctgaagctaa 

118981 cactcagaat gatgagattt catgtcttga acatggaatc ccagctttcc ttggaaacgc 

119041 aggccctgac cacagcgggt cttggatgga acacatgccc ctgcaactgg ctccagtccc 

119101 tactgccccc caactgtctc tcctgctctc tgtgtcagcc accagcccag actctacagg 

119161 cagctgaact gctgcccctg accccctagg ccagccagca ccctccctga ctacaaactc 

119221 ctcgctgggt cctcatttcc gttgggactc aggctgctct atatgctggc attcaggctc 

119281 ttggcctgcc ccaccctccc agaccgcggc ttgcaccacc ccatgcttct cactgggctg 

119341 ggtgcaccct gccgagccac ccagcccctg aaccccatgc tgccaactca ggaatcccca 

119401 ttctcccacc ccaccacccc atccccagct cggccatctg gtaaacctgt gttccccctt 

1194 61 cgaggcctga ttcagacccc ctcacctctc ctgccctccc ctgccatcca gcctgccctc 

119521 ccctgccata ccctgccctc ccctgccata ccctgccatc cagcctgctg gcaccatcag 

119581 gtccacccat cccatgacaa ctccagctcc tacactttta tcttctggtc cattatccat 

119641 aaaccaaggt ctcacactgg aagtgcacac tgtgtccagc cttcagagtg tttccttcag 

119701 atcacaagct tatgacccat gcaggggatt ttaaaagctt gaatttgttg ccaaacttaa 

119761 gaaacaggag agttcacata aacgaccaga tttccagact ctcttgaaaa tcagaaaagc 

119821 tggcatttcc agcacccatc gacactccag cttagctggc gctggccggg gtcccctgtc 



131 



PATENT 

Docket 524593008900 



119881 
119941 
120001 
120061 
120121 
120181 
120241 
120301 
120361 
120421 
120481 
120541 
120601 
120661 
120721 
120781 
120841 
120901 
120961 
121021 
121081 
121141 
121201 
121261 
121321 
121381 
121441 
121501 
121561 
121621 
121681 
121741 
121801 
121861 
121921 
121981 
122041 
122101 
122161 
122221 
122281 
122341 
122401 
122461 
122521 
122581 
122641 
122701 
122761 
122821 
122881 
122941 
123001 
123061 
123121 
123181 
123241 
123301 
123361 
123421 
123481 
123541 
123601 
123661 



agttaggag 



ggggggcacg 
ccccatcctg 
gcctgctggg 
gcgcactcag 
gggcctggag 
gcaggaggtg 
agagagagga 
cggagcacag 
tgcccggtgg 
tggggggagc 
gctgcggaac 
ggggtaatgg 
gaggagcgaa 
agcacctggt 

caggtcagct 
gggagtggag 
tgccagatgc 
cccccaggag 
gccctgccga 
ctgtgagaga 
attcagggtc 
aagggcaagg 
taggaggcag 
agggctgtgg 
tcatccctgg 
tgagcgaacc 
caggcgggaa 
caggagttgg 

cggcagcggg 
ggccaggtga 
tgaggccggt 
agaaccaggc 
gaaaccgagg 
actcacacct 
tgagccaatt 
ggggaacttg 
ttctggaggg 
ctggccggag 
cctccctggc 
tgatgctgcc 
cacaggagca 
cagaagcctc 
cgcccgtctc 
ggcacctgcc 
gaccaaaggc 



agaccaaagg 



tgtgccctct 
ccccaaagta 
gtggcgcacg 
ttgctggctg 
cggctgcccc 
cctgggccct 
gaacacagag 
acatctaccc 
ccagtgctcc 
ctggtgggaa 
cccatcactc 
cttgatctga 
aaaaatgggt 
tccagacggg 
atggacagga 
gggcttccag 
gcagctgtga 
ctggggctgg 
ggggattagg 
cagctttgga 
ctggcagagg 
tgcctggatt 
acgaattcct 
ctggtgggtg 
ggcgcaggac 
tggtaaccag 
gaggagagcg 
gtctcagggc 
gatgacccca 
qaacacaqcc 
gagggtccag 
tcttatgact 
caagctgtcc 
cctcaggcaa 
catttcccgt 
cacgcctcca 
cctagatggg 
acacggaaag 
agggcaggac 
cagcccaggt 
acctagtgtg 
taacccgggg 
tgcctccctg 
ctgggccttg 
cactggggtc 
caagcggtgg 
tgttgaccac 

tgaccccccg 
gagaccaaag 
ctgacccccc 
gagaccaaag 



ggtgggcagc 
acctcatgat 
ggctgcccct 
ggagcctctg 
tagcctcaag 
cgggaagctg 
cccagcatca 



ccaaaggctg 
ccccgcgaga 
aggctgaccg 
ccgagaccaa 
tgaccccccc 
ccagagacca 
gaccgccccc 
ccaaaggctg 
ctgccaagac 
agatcatttt 
ctaaaacaca 
ctgcattttt 



ccaaaggctg 
cccccgagac 
aggctgaccc 
ccccgagacc 
aaggctgacc 
gagacaaaag 
acccccccga 
cgctggccag 
attccctgat 
cagaagaaaa 
ttccgggact 



atatggggga 
gagctgacca 
agggggctgg 
ttccagttca 
ctggggtcct 
agcatgagag 
cgtggtaggg 
gtctgtggct 
gggtagtgga 
ctgggccctg 
cagatgcgca 
gattggagcg 
ccagaaagtg 
tgggtcaggt 
gctgttttaa 
gccggcaaag 
cctctgggct 
agttaccagg 
gaagaggggg 

aagctgcagg 
aggcctgtcc 
cactgatggt 

tggagegtga 
ggtgtcaagc 
gcacactcac 
cccaagtgat 
cagacaggct 
ggcatctccc 
actggggcat 
cgggagccat 
acaatgaggc 
gacagtttgg 
aggacaatca 
cctcctggaa 
catctgaagc 
cattctccaa 
cgtgcgatgc 
aggctgaccc 
cgagaccaaa 
gctgaccccc 
cgagaccaaa 
gctgacaccc 
gagaccaaag 
aacccccccg 
caaaggctga 
cccgcgagac 
aaaggctgac 
cccccgagac 
gctgaccacc 
gaccaaaggc 
aataatgctc 
tttcaaacac 
ttgacaggag 
ttgattttcc 



ccccggcacc 
ggcctctgcc 

cagaccgaga 
tgtagcgagg 
atgtgccagg 
ggcctggctc 
aatccctttg 
aaccgcacat 
ggtgggtgga 
tgagggagat 
gaagcatggt 
ggggagaagg 
acaaggacga 
tagagccggc 
gaagaaccgg 
catggtggaa 
aatgcagtcc 
gaggacgagg 
atgtggcctc 
tcctccccca 
tataccgact 
gtggttcctt 
tgggtgccat 
caggcatcct 
gcaagaccga 
tgggaaccct 
ctggcgaaga 

actggcacag 
cccggtgcgt 
ggatgaagtg 
agttctgact 
tccccaagcg 
acaggagcac 
ttcatccagc 
gctggggaca 
agacaggtcc 
gaggacccct 
cgatgccaca 
caggctcagc 
ggtctccctg 
ctgggctgag 
gggcacccct 
agggcctggg 
cacagacgtt 
gcacaacgca 
ccccgagacc 
ggctgaccgc 
cacccgagac 
ggctgacccc 
cccagagacc 
gctgacaccc 
agaccaaagg 
cccccccgcg 
caaaggctga 
cccccacccg 
caaaggctga 
cccgagacca 
tgaccccccc 
ggggcggggg 
ttcagtatat 
agcaaagcac 
tgctgctagt 



tgccatctgg 
atatcatcat 
gaccccaggt 
tgcagtcagt 
gaggctggag 
caggggctgg 
cagaatgaag 
tgctggccag 
gccatggccg 
caggagcatc 
tgtaggcagc 
ggctgcggca 
tgttacaggc 
gtggcaggta 
tgagatggga 
cccccaaagt 
agagactctg 
ttgcccactt 
ccatgggcag 
aagccaagga 
gagtctccag 
tcggacgtcc 
acagcagccg 
tctccgaggg 
gggcttctgg 
ggtcggagaa 
gcaggcccca 
ctcagcagat 
caagggaggc 
gggcagtgtg 
ggtgtggaga 
caggaattaa 
tcctggcacc 
tccagtgact 
atgcagcctg 
cagcttgctt 
cacaggttct 
ctgggccaga 
tatggcagct 
ggcctgaggc 
ctggggaagg 
accagcaggg 
ccaacagcac 
gcaggcgact 
ctcaggcttc 
caacatgctc 
tgctggacga 
aaaggctgac 
ccccgagacc 
caaaggctga 
cccgcgagac 
aaaggctgac 
ccgagaccaa 
ctgacacccc 
agaccaaagg 
ccccccccga 
agaccaaagg 
cccccccgag 
aaggctgacc 
caagaccatc 
gcacttcacg 
gcatcccagg 
aaaatgtgag 
gtcaagtgta 



atgtgcggaa 
cttggttgtg 
gctcggtgag 
gccccccagg 
atgcctcccg 
tggaggtgca 
gatgcctccg 
gcaaagagtc 
gggaggtgga 
aagggcacgg 
ccagggcttc 
gggggaatgg 
acctgggaga 
tgaaggaggt 
gggaggagga 
tacccaggcc 
cgggtgtgat 
ccttatgccg 
acagggagga 
atgctggcag 
agggagcgca 
ggcctcagaa 
cagaaaagta 
gcatgagaag 
ggaagaggcc 
ggaggtcagc 
cgcttctgac 
gcctgtggaa 
accgagcagg 
tttaccagtg 
aggctggtgc 
ccctgggggg 
ctttctatct 
cgtctgtaca 
ggagacactg 
tccaaatgag 
agggatcaga 
ctgggccgat 
ccatcttctg 
cctggggtta 
gcagtactct 
gtcaggctcc 
tagctccatc 
cctcggaggg 
ctcacacagc 
ctcacagccc 
tgaacatggt 
ccccccccga 
aaaggctgac 
caccccccag 
caaaggctga 
ccccccgaga 
aggctgaccc 
ccgagaccaa 
ctgacccccc 
gaccaaaggc 
ctgacacccc 
accaaaggct 
ccccgcgaga 
agctgacgtc 
gaagaagtgg 
aatgatctgt 
ctatgttttg 
tctgttttta 
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123721 catgtaattt ttattaatta gaatcagttt cctctcccca ctcaataaat attaaagaaa 

123781 ctccagaagc acactaggct gaaaagcact cagttgggag cctgctggtg gcctgtctgt 

123841 ccaaccccac agctgtccag cctggggtct tacagaaagg ttggtacccc gacctggctc 

123901 taagtgtgag tcccgctctg ctcttgaagt ctgtggatgt ctggcaaacc cagccaggtt 

123961 ctcccctggg ggacagagag aggatcatgg ggagggctct gtccctgacc ttcaggtgcc 

124021 accgcagggg cccagagcct ggtagtaccc gggtcatcaa gtcaagttgc cggcattcag 

124081 atcctggtga gttacacaac gcctgccctg cctcttcctt ctatagaacc tgctcctttg 

124141 cagttggatt gatggtaaag tagttactag aaatggcaac tagatgttaa aaatttataa 

124201 agatccaaag gcttaacagc tctcctctat tactctgaat ttttaagtat tcaaccacag 

124261 acagcctgga agttaggggt tcacaggaaa cacaagcatg catcctgttt ccttctgcag 

124321 agcacccagg ctggctggtt cccactgccc cttcccatgg aagccacagc tgcagccatc 

124381 agcagccccc cacctgtgta tgactgggac ctggtctgtc actcaaccct gtcaacctcc 

124441 agcataaaca ctgtcaggtg tgtgccaagg ttgcagtggc aacaatcaag catagaaggc 

124501 acaaagcata ggaagcacac aggtccatgc acagagggac ccagactttg ccaagggaca 

124561 aggctgcaag tccctgttgc cctcaaactc acctccctcc gaaatacaca cacagcacgc 

124621 accttgccaa gagtccccca gatgctggca tctgctaaga ttcccattcc tgaaagtatt 

124681 ctaactccaa gattctttct ccccatcttc cttttggtaa tgaacctggg agatctgctg 

124741 gacaacgggg caatggccag gcagcctccc ctgctgtgac agctgcctgg tggaatgacg 

124801 gtctcacagg gagagctgca cagtgcctgc agttcccagg gcgcctcctc tcagagcctg 

124861 ctcgggcact ctcttgcctc tcccagctcc ccagggaggt ctctaacaac cttgggagct 

124921 ttcagggaag ggggctggac ccctcactac tgaacagtcc caggagtggg aagtgtcatt 

124981 gagtgggatt tgagccactc cgtcctcacc actggacagt cccaggagtg ggaagtatcg 

125041 agtgggattt gagccactcg gtcctcacca ctgggggctc ctttacagtt cagcatccca 

125101 gaggatttca cacactgact tttccacttt tccaaaaggg acgccagcgg gcaggagctg 

125161 tccatggggt taggaaacag ctgtcctcac ccgcatctcc accagctcct cctctgtctc 

125221 atgggatcct cgccaggtgc cagccactcc cgtccttcct cttggagagc gaccccttgt 

125281 ctagggctct gtcttggagc tatggctgga gctgggcctt ccagggttct gctgagcaca 

125341 gtgaccacaa cagagttgcc tctgagctcc acagaacctg atgactttgc ttggccactc 

125401 taaccttccc tggacctggt ttcaggccct acagaccttt actccagaaa gtctcttttt 

1254 61 tggcctaagc tggccggagc caatttccaa caggaccaag gagccccgac agtcacatgc 

125521 ccagcgaggc tctaaggaaa gctagactcc actttgaagg gcagggtggg gggcgaggca 

125581 ggcagggaga gtaagcactg gacgctgagc caggggccct ggtttccaat gcggtcatgg 

125641 ccagtcctgc ccccttgcag gcccccagtt ccctcatctg caaactgagg agctggacta 

125701 cgtaacctat gaattactgc ttatcctgtt tattcatcac cccaaaaaga aaccccgcac 

125761 acgttggcag gtaccgatca tccttctaaa actctactag gaccaaggag aaagaaccaa 

125821 accaaaaacc agaaacccac tcctgtaaaa taaaggccaa ggccgtccca tggcacgcaa 

125881 ggccttccat ggggtcccag cagccctctg tccctgggcg cacccccagt gtcttcctct 

125941 ctctccgggc ttctatttgc acctcaaagc ctacaccact gcccttctct cttgttatag 

126001 aggtcagtaa gtagcagaga gctggtaaag atctgtcagc accaaactga gactcccctc 

126061 gactcaacca gagagtttaa aatgcaactg aaaagggaat catttttcat tataaagttc 

126121 aatgcttttt ttcttctttt tttgagacag agtctcgctc tgtcacccag gctggagtgc 

126181 agtggcacaa cctcagctca ctgcaaactc tgcctcccat ttttaagcga ttctcctgcc 

126241 tcagcctccc aagtaactgg gattactggt acacaccacc acatccagct aatttttgta 

126301 tttctagtag agatggggtt tcaccatgtt ggccaggctg gtctcaaact cctgacctca 

126361 ggtgatccac ctgcctcggc ctcacaaagt gctgggatta caggcataag ccatggcgcc 

126421 cggccagttc aatgcttttt aatatataac agctttatgg tgctgtgatt accattccat 

126481 acatctcagc catctaaact atatgctgaa agtgaacaat tcaaccataa ccactattga 

126541 ttttagaaca ttttcatcac cccaaaaaga aaccccgtac acattggcag tcactcccca 

126601 tgttttcccc aacctctcct ccccaaccca gccctgggca accactaatc tacattctgt 

126661 ctctatgaat tggcctattc tggacatttc atatacgtgg aatcacatgc tatgtggcct 

126721 tttgtgactg gcttccttca tttagtatgt tttcaaggtt catccatggt gtagcatata 

126781 gcagtatttc attcctcttt tattgtcaaa taatacccca tcacatgtat ataccacatt 

126841 ttatttatcc atttgccggt tgatggacat gtgggttgtt ggtttttggc tcctatgcat 

126901 aatgctgcta tgaacatctg tggacaagtt tttgtgtgga catgtgtttt cattcctctc 

126961 aaagtggaat tactgggtca tacagtaact gtgccagact gtcctctaca ggggctggac 

127021 cattctcagt gcccccagca gtgtgtgagg gttcaatttc ccctcatccc caccaatact 

127081 cattattctt aggctctttg actctagcta tccagcaggt gtgaagtggt gtcccatcat 

127141 gcttcggctt gcatttccct gacaactaat gatgctaagc atcttttcat gagcttatag 

127201 gtcatttatt taactttgga gaaacatcca ttcagatact ttccctattt ttaataagac 

127261 cattttcttt ttattgacct gtaagagctc tttatatatt ctagacacaa atcccttatt 

127321 agatatacaa tttatgaata ttttctcttg ttctgtgggc tgtcttttca actttcttat 

127381 tagtgtccac tgaagtcatg aaggactttc ttgaagtcca gttcacctat ttttttcttt 

127441 tgtccacttg tgcttctggt gtcatatcta agactttgcc taatccaaag taacaaagat 

127501 ttactcctac attatttaag agttctatag ttttagctct tacatgtaga tctatgatcc 
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127561 
127621 
127681 
127741 
127801 
127861 
127921 
127981 
128041 
128101 
128161 
128221 
128281 
128341 
128401 
128461 
128521 
128581 
128641 
128701 
128761 
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128941 
129001 
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129181 
129241 
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129361 
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129481 
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129661 
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129841 
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131041 
131101 
131161 
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131341 



atccatttac 
tatgtggata 
acttgtcttg 
attctaccaa 
tttgtagtaa 
cattttggtt 
ttctgccaca 
tacccattag 
gcattagaaa 
tgggaggctg 
gtaaaaccat 
tgtgagcaac 
agagaatgcc 

aaatcacagg 



taatcttcag 
tacaagctga 
ccttccaaat 



agttaatatt 
tccaggtgtg 
gacctttgtc 
tctatatgtc 
gtcttagaat 
agtctggatc 
aagctagctg 
ttcacctcct 
ccaagggctt 
aggcaggcgg 

tctgggccct 
ctcgagagcg 
tttacgcagc 
tcgaaatagc 
gaagacattt 
tttaacttta 
aaagagtaga 
aatcttcagt 
agaattctac 
gactgaaggc 



tgtgtatggt 



gaacaaatat 
ctgaggcagt 
aaaatacaaa 
tgaggcggga 
ctgcactcca 



ataacaataa 
taggaatttg 
attagccaga 
gaatcacttg 
gcctgggaga 



aagaatcaat 
tatccttctg 
tacaaagcat 
ccttgtattt 
ggatttgata 
aggccaatgt 
ggtcaggcat 
atcacgaggt 
gatgggagcc 
gggagagggt 
tctgtctcag 
ctggacttga 
ggaattgccc 

ataaatgggg 
ggaagaaaaa 
tagcatcaac 
tcaaaattag 
agaccatata 
tccagaaaaa 
atgtaggctg 
agaccaggct 
cgtggtggtg 
aacccaggag 
cagagacaga 
tggtc 



r ggagacagag aqatac 
t cgtccataat 



gtgaggaagg 
ttgttgaaaa 
atgaggatgt 
ccaacaccac 
gagcccttcc 
ctgtatgaat 
gaattaaatg 
ttaggaaaca 
ggtggctcac 
caggagattg 
ctaagtctat 
gcacagaggg 
ttctgcttgg 

tctgcatcta 
ctggaaaaaa 
gcaggcaaaa 
aggaattaca 
agttactggt 
aaaagtcaca 
gaaaatatgg 
cagaaaacat 
ggcatgatgt 
ggccaacatg 
cacacctgta 
gcagagttac 
ccctgactca 



acagtcttga 
tctgttcttc 
tttaggatca 
cattctggag 
ttggcctaca 
accctgtaat 

caactaacca 
atggcttcaa 
catccctccc 
gagacacctg 
tgctacaggg 
aaagctatgt 
aattgccaaa 
gtttaattgc 
aagtttctaa 
actagtaaaa 
tagactagaa 
aaattagatg 
gatcccagaa 
gtgaaacccc 
atcccagcta 
agtgagccga 



aattagccgg 
agaattgctt 
cattctgggc 
acaaaaacaa 
gtcacccagg 
ccaggttcaa 
caccatgcct 
gctggtctcg 
attacaggtg 
tgcaaaataa 
tttgaccact 
cccagctgca 
acagtctaat 
tcacctctga 
acagggtttc 
cagcacgtga 
cagctgttag 
tttttattaa 
aacagacaat 
aagggggaaa 
gccagccagg 
cctgtccaga 
tgcacacacc 
ggacacagag 
tgacgagaac 
cagcatgaag 
agggcgctct 
caaaaagcaa 
gtggccttga 
agatcctgcc 
ccgaggcagg 
accctgtctc 
cagctactcg 



agaccagcct 
gcttggtggt 
gaacccagga 
aacagagtga 
taaatggtca 
ctggagtgga 
gcaattctcc 
ggctaatttt 
aactcctgac 
tgaaccaccg 
ttttacaaaa 
taaggaacag 
gcatgacctc 
aattaagcat 
ctgagtggcc 
catggctgat 
taaatcagcc 
gaaacgcgta 
tattttctgt 
tcatgtgaag 
atggaataaa 
cggccttctg 



itg 
aatgccgggg 
gtcagtgggt 
tgacactcgt 
cccacgtcgg 
ctactaaaga 
gatttttctt 
gacgtgtggt 
aacaggccgg 
cggatcacct 
taccaaaata 
ggaggctgag 



ggccaatatg 
acgcgcctgt 
ggctgaggtt 
gactccatct 
gattattatt 
gtacagtggc 
tgcctcagcc 
tgcattttta 
ctcaggtgat 
ttcccagcca 
tttacagatt 
acacacccat 

gtggcctccg 
tctctgtgga 
gtgagggcta 
gactggcagc 
acaaagggtc 
attttttttg 
acaggagaga 
caagatgccg 
ctaaatcaag 
ctcagccctc 
tagcctccag 
cagacccaga 
cccggccaga 
ggaacgtccc 
agaaaaatga 
tctgagcttc 
ggaggagagc 
gcacggtggc 
gaggtcagga 
caacaaatta 
gcatgagaat 



gcagtgagcc 



attattattt 
ctgagctcag 
tcccaagtag 
gtagagacgg 
ccgcccacct 
gtcagattat 
tatgctaggc 
gacatattgg 

agtcactctg 
cagtgtctgc 
agcgagatac 

agcagtgtta 
ttttacattt 
cggaaagggg 
acaagaatgc 
caacgcttga 
tgcagccagc 
gaaacagagc 
ctctgggact 
ttctctggct 
cagtcacaaa 
ctcgtgctgc 

tcatggaata 
tcacgcctat 
gtttgagacc 
gcgggacatg 
cgcttgagcc 



tattattatt 

aggcgaatgg 
gtctctacta 
gcttgggagg 
gagattgcat 

tggagacagt 
ttcactgaga 
ctgggattac 
ggtttcgcca 
cggcctccca 
ttcaataaca 
taaaagagtt 
tgaccgaaga 
tctctgtgcc 

agggttgaat 

tcatgatgat 
cgatcggggg 
ttacaaaggg 
gagagcacag 
ctcacggggc 
gtggctccct 
tcccccatct 
tggccagcca 
gcaacaggcc 
gacatttccc 
ttaacattca 
aggaacattt 
cagctggacc 
aggtccaact 
aatcctagca 
atcctgggca 
gtggtgggcg 
cagaaggcgg 



catttttcca 
tcccccactg 
tctcagttct 
ttactacagc 
tcctcaatag 
gcttgtcaat 
tcatgtttgg 
aagaaactct 
cccagcactt 
ggctaacacg 
ggtgagccac 
ccctgctgct 
cagcaccccc 
gcaagtgggg 
ataatgaggg 
aatatataaa 

actcgtttcc 
aatcagtaga 
ctaatgtata 
aaacagcaaa 
acagaagtaa 
ctctgggagg 
atctgtccta 
ctcaggaggc 
gatcatgtca 
aacctaaaac 
attttggaga 
caaccgggcg 



ctgaggcagg 
cactgcactc 

gtcttgttct 
cctctgcctc 
aggtgcacaa 
tgttggccag 
aagtgctggg 
ctctatgccc 
tctcagattt 
aggcggcaag 
aaacgtcagc 
aatgtttact 
ggagataata 
gcactgggca 
catggcatga 
tgagtttcac 

agagaaagat 
cagcgcaggg 
cgggacctgg 
ccatggctcc 
gacccagcca 
tgcctatttg 
aggtctcgtt 
cataaataac 
gccaaactat 
acagcagatt 



ctttgggagg 
acatggtgaa 
tctgtagtcc 
aggttgcaat 
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131401 gagctaagat cgcaccgctg cactccagct tgggctacag agtgagagac tcttgtctca 

1314 61 aaaaaaaaaa aaaaaaaaaa agaccctgtc gacaggccct tccctgggct ggcacctgcc 

131521 cacaccagct ggaacctaac acctttgcac agctgttccc tgccagcctc cctgcctcct 

131581 tggaggtctg cacacaccgc ccctcccttc ctccaaccag cccgggagac cagcacgcct 

131641 gagcaccagc actcccagag acagagccgg cacggtgggt ctgtctgctc agggggctgg 

131701 ggtggcaggg gcacatgtct gggagttgcc aacatatgct tccttaccag attccccagg 

1317 61 ggacccaccc tggagagcaa ccaaaccctg caaggctgca ctccaagaac tgagggcctg 

131821 gggacccagg accgtccccc agcattcaca gcaggctcac ccacatccac caggagatgc 

131881 tgcccaggta gactgccagc accccctccc acaaaaggtc ggggccggcg ccttccctag 

131941 gttgctcggg caaccatcac agggcagcac aggccggggc ttgaatagca gaacctattc 

132001 tctcacagtc ctggaggcta gagctctaaa attaaggtgt gggcagaggc ggtttctctc 

132061 gaggcctctc tccttggctt gtcgacagca tctttcccct gtcctcacat ggccgtcccc 

132121 ctgtgtgtac ttctgtccta atcgcttctt cttaggacac cagtcattca tgttgaatca 

132181 gtgacttcat tttcagttaa ttatctcttt aaagaccctg cctccaaata cagccaatat 

132241 cctgatctac gggagtttag gacctcaaca aaggaactgg gggatctagg gagggcgtca 

132301 ctcagcccat aacagcctcc aggcgccaag tccacacaga atcggagcag gctgctggca 

132361 gggtccggtg gccagagctg ggcgggctgt tccttgggga gccactctgg acaggccgga 

132421 aggtcaatgc caggatcact ccccaggggc cagaaatcca gcatggagga aagctgccac 

132481 tccagggaag taggagcccg cccagggctg gccacatggg atgcccatct cacccttcct 

132541 tctaaactgg gcattctcac atggggacac ggggcagtga gtgcaggcgc agggagacga 

132 601 gaatgctgca cagccggtag gtggtggacg ggcctcagga cccctccgcc agcagcaggc 

132661 cactctctca cggtccagca ggagccctgc tccagagccc acgactggag ccagctgagg 

132721 ctggtgtctc acattcctgc agagaggctg gctaaggctc ccgtgtgggg gcactcccca 

132781 tgcccactgc accccagaat tttctgggcg atcccaaggc tgagtctacg cagctgttgg 

132841 aagacggggt atgcaattct agatgtgtgg cgtgttttga acagcatcca ataccctttc 

132901 tgtggcagct cagcccggag agcaggtccc cagtctcgag gacccaggag gcagggtctg 

132961 ctccttccca cagcgggtgc tcctccagga ggggccaggt gctcagaggg aaaccagtgc 

133021 ccaccaaggg cccagctgcc tggctggctc ccagcagtgt ggggaagaca cactacccga 

133081 cggacgtccc ccttagaagc aggggcagac aggaactccc atggggagag cacaggctgg 

133141 gacgcagccc gcccaacgcc agtcccccca ctccactgtg caacaaggat cagcctttcc 

133201 tctctcaaag cctcggttcc cccagagggg gctgcagttg agagttgatg gtgagactca 

133261 gtggggcatg tcctccatgt ctgcttgtgc aaagctgccc agatcaggca gttttcccag 

133321 ccagaggacc ccccctttca ctttccatca caggctgtca tctccccggg tccctgggcc 

133381 acaggcccct ctcaggaccc ctccggcctc atgcacacac agagcctgtg cactgcctca 

133441 ctcctgcttc ctttgaattg ggcttctttt cctgctgtga tctctggggc ccactgtgct 

133501 atccagggca cgctgaccgc cgaccgtcct ctgcccgcct gcccagctgc cactcccggg 

133561 cgggcagccc tgccaggccc gaggctgtct gcttcccctt ctgtccagag ttcccaaaac 

133621 aaaaatcagg ccataccatc tgacgaagga tcctgtctgg tttatttatg ggaagagcca 

133681 gggctctgag ggccaggttg tctgggctgt ggccaggctg ctctgcaggc cttcagacac 

133741 tgcccagccc gtgtccccac ccacccaggg tggctccggg ggaggggcgg gggcagggcc 

133801 ccagcactgg tgagcccaca ctggtgactc tgggatgtgt atttattcta gagccaggac 

133861 agcagccgcc agggccctag gcccctctgt gggtggttgg cccactggag agccggagca 

133921 gcgaggccag caatgaccaa aggcgccgta tccagaatgc aggcaggctc aggcgcacct 

133981 gccctccctg gatgccacta cagattcggg gagactgaac tcatgggatc aagtgcccca 

134041 cagatcctgg tcctcacaag ggacaaggcc accaggtgtc acactgggag acagtggcac 

134101 cccagggtat gaaaggccac accaggctta tgtccagtaa ccctgcctgc acagcacagc 

134161 acggcacggc atccataccg cagcacagcc ccgccctccg gcagtgccaa gcccgggatc 

134221 tcagggcccc caccagaaac cagacaggtg agcaggtggc agctccaggc caggtgctga 

134281 ccttgtgcat accaggcctt ccccgagagc aggctgcagc cccccagctg gttctgactg 

134341 ccacaggact gcgcccctcc tgcagggctc tgctcacacc cacactccct gctcagagac 

134401 ggcctgtccc accgccccgt gtgaaatcac acccaagcac agcggaacga tggaggaggc 

134461 agctggaagg tgctttcggt ggtcaaatct ggggcaattt gaacacaata ataaatgata 

134521 gtaataggtt agaacccatt gaataaaata aatatccaca gatcagtatt ggtagaaaca 

134581 aataactgaa taataaataa gtgggagggg aagggcagct ctttctcaca gggtaattca 

134 641 actaaaaatg taaaagaaat gatgagatag aaaatcatgg tagtaaaccc catcggcaac 

134701 agcacagtca ggactgcggc aggtacgact tatccgtgga ggctacaatt agtgagcaaa 

134761 aggataacaa gaaacagaat gctcacatca cgtcacagca gctacccacg ggagatttaa 

134821 tcatcgcaat gggaaaaaag agtgaaagag tggtttgctc atcaagccac tcatctgaga 

134881 cctgaaaaaa cctcacagtc acagccctag agctcccggg gcatccagtg gaatcccagt 

134941 aggtaacacc ttagccaaat gaagaagggc atcactgaca ataggagaca actggacatc 

135001 ccaatgtgac agtcccagcg tggtgcccta cggagggcac agtgcttctg tgacagtcct 

135061 gccagaaaca cgtaagctca agccagtaca aggcacacca gaaaaaccca tggggatgca 

135121 ccgtctacaa aacagctgac cagtcctctt caaaaacatc caggtcatga aacctcagat 

135181 gggaggaaac gggagagaca caacaactaa atgccacgtg ggatcctgag ggggagcctg 
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135241 
135301 
135361 
135421 
135481 
135541 
135601 
135661 
135721 
135781 
135841 
135901 
135961 
136021 
136081 
136141 
136201 
136261 
136321 
136381 
136441 
136501 
136561 
136621 
136681 
136741 
136801 
136861 
136921 
136981 
137041 
137101 
137161 
137221 
137281 
137341 
137401 
137461 
137521 
137581 
137641 
137701 
137761 
137821 
137881 
137941 
138001 
138061 
138121 
138181 
138241 
138301 
138361 
138421 
138481 
138541 
138601 
138661 
138721 
138781 
138841 
138901 
138961 
139021 



aaccagacaa 
ggttagaatg 
ggcagtgttg 
tgagtggctt 
tagttcccga 
cacatgggct 
ccagaagcca 
ttcttttctt 
aaaactaaga 
cttgttctac 
ttttgcaact 
aaaactttaa 
gtccattctt 
tccagggagc 
gccctgagaa 
gagcaatgct 
ttcagggcac 
agggctgttg 
ctcactggcc 
tgctgacacg 
gggatacagc 
ggggccacct 
tgtgcctctg 
tccgggcagt 
tgttgccctg 
catcatgctc 
ccctcttctc 
ccattaaacg 
gatgaggctg 
tatttattta 
caaacacagc 
cccaagtagc 
aaatttgtag 



cggacatgag 
tgactggttt 
gaaggtgggg 
gatactgttc 
gcctgggttg 
ttccctttga 
gggccatgct 
tataaattac 
cactaagtac 
gttgcataca 
tttcttgcaa 
acacataaaa 
cacattggct 
aggtcctgta 
ggaagcaagc 
cttgtgcaga 



agt. 



eta 



tggcccaaac 
cactctgggc 
gggaacaacg 
ccacaaccag 



ccccagctgt 
acaggaagag 
acagtgagca 
ttgetattec 
atagacagga 
gccatgcctt 
cagcttgccc 
aagcagggee 
tcctgaggtc 
ccccaatagc 
caaagagaag 
ggcttgggac 

gaagccgctc 
aaaataagga 
gctaatcttc 
gctgactccc 
ggcttgaagt 
gtcagagccg 
tgtgccaacg 
aggecagaac 
tccagggtag 
ctaggctgga 
gtgattctcg 
agctaaattt 



actatgaggt 
aggaggggac 
tgggttttga 
tgcccaagcc 
ccgggctctg 
tcactggtgc 
cttccagggc 
caatgccacg 
tcctcccagg 
ttgtcattct 
aggttgaatt 
caagatgtgt 
tttatttaga 
tcactgcagc 
tgggaccaca 
aaatgaggtc 
ctgccttggt 
attaaattta 
aagatccaca 
gaagceggea 
cactaaatac 
caggttccaa 
gggagatata 
atagctcctg 
aatgtgttcg 
ccatgtgggc 
gtcctgeagg 
tgggaacagc 
aaggtgactc 
aatccaagct 
ctggctgggg 
ctgccttgaa 
ccctcctgtt 
atcctctgtt 
tggtgacagc 
ctccttgtcc 
acgttattgg 
acaaaagagg 
acatgagcag 
gaaaactgac 
tgegctgage 
cctccgggcc 
aggccagtgc 
ggttttgttt 
gtgcagtggc 
tgtctcagcc 
tttttttttt 



teggaagact 
gttcccacca 
cctggggtga 
tcaaggtagc 
ttataaaagc 
ccctctccac 
cttgaacttc 
ccagtctcag 
caaegcttag 
gaggaagctg 
tttttccaag 
ataaaactgg 
gcctttccct 
gaaggtggta 
ctggcgtgtt 
gggcctatgc 
tctgccctgt 
tgcttcaggt 
ttccttcaca 
ggattcccac 
teaagectga 
gtgttcaggg 
ecagggeggt 
tggctgeate 
acccttcccg 
ctggtctgca 
gcctgctggt 
ctgggagtgc 
ggttcttccc 
gacaggatct 

gacgtgtgcc 
ttgctatgtt 



acacgaccag 
ttggtggtca 
ttcccggggg 
atggggctgg 
agagccacac 
aatatgaaat 
agectgagea 
tgcttctcct 
tcatgaatct 

ctagaaggac 

cccaccccac 
ccctccaagg 
ttgtcttctg 
ttcctagagg 
gttgggggga 
cacagcttct 
aeggggaaga 
tgttgctttg 
cagcggcctg 
cctctgattc 
atctcaatgt 
tagacagata 
acagcccagg 

gtttgtttgt 
atgatctggg 
tcccgagtag 
tgtatttttt 



ggtatctgaa 
aagggaattc 
aggacactcc 
cacgttatga 
ccagcctgca 
gtattccatt 
tatcaaccta 
ggtgaagggc 
acagcataga 
gccccagcct 
ggagtggagg 
ggttccaggg 
ccaggaacac 
ctgagccgcc 
geatggggee 
ctcagtgagt 
ggtgtgaatg 
tcacatcaga 
gtggcacttg 
gtgggcagtg 
ggcccacacc 
cacctgcttg 
tgtcctctct 
gagggatgga 
ggcagagtgt 
acjaagtaaca 

tactgtgtca 
tgggctcaag 

gecagggctg 
gctgggatta 
atgaggacgt 
tcattaatga 
cagagagaag 
ggaatgcggg 
attgtgagcg 
tagctgggac 
gtaacgaggg 
ccccgtcccc 
tgttataaat 
ccagggcccc 
tgtcaggaag 
caagatacat 
cctggagtgc 
taccccattt 
cttaagtgag 
ttgtcaggag 
gagggtggga 
cactcacagc 
cagtctgacc 
tcctgacaga 
tgccttcctt 
ttgeaggett 
gaagagtatt 
gggaacatgg 
acaagcgtgg 
ccagcccaag 
ttttctcaag 
ctcaccacaa 
ctgtgattac 
tagtagaatg 



agtgcagtct 
tgacatttgg 
tcatggaggc 
ttgctctgga 
ctgggtttcc 
cacagcacaa 
gaaccatgag 
atagcaacac 
caaagttcct 
acagaggaaa 
tctaaagtta 
ccctcagtgc 
cgcccactgc 
ateggggage 
cagggagaac 
ctgtgggccc 
acggatgccc 
ggcttgatga 
ggactgagee 
cagaatgagt 
tgggctcctg 
gcagcagaca 
ctcaaaggtt 
gagtgettte 
tgcctccaag 
atgcctgtca 
ggaaggcggg 
i gcctcaggac 
i aggtccaaac 

I tgatcctccc 
i gctaattaaa 
atgttgaact 
cagatgtgag 
atggctcaca 
ageaeggtaa 
eggeggagtc 
gcaagaagat 
gtaacagctt 
agcacactcg 
tetctgeaag 
taaatcttcc 
tttttaatca 



tgtatc 



agaccactgc 
ccaggcctcc 
ttgaggtcag 
ctacgctggc 
agcatctttc 
actggtgtaa 
ccactttcct 
ctcacacgtc 
gaggaaagtc 
tgtttatgga 
ttccctgcta 
ccttcccatg 
gacatcagcc 
acaagcaccg 
atggaatctt 
cctccacctc 
aggcacacgc 
gggtttcatc 



ggagtgctac 
tccccagtgt 
cgatccctcg 
gaggcttgat 
ettctttgea 
aaaggectea 
ctaaataagc 
aaaatagact 
ggtttcagtc 
tetatgetat 
tttcaaaatt 
tctctggccg 
atccccagtg 
aagtacagag 
cgtgtagccg 

caggagctct 
gcgagccacc 
accagcatga 
geggggctet 
categgctgt 
tccttcactc 
ccgtgtcatt 
agagggaagc 
ccgtgctgga 
caactgggat 
agtgtgccca 
tgatggggaa 
gatatttatt 
tacagtggct 
aactcagcct 

cctgggctca 

ccaccttcaa 
aggagaaatg 

tgetattace 
accataaaga 
cactgagcag 
ttgatacatc 
aattgtctat 
gggacageca 
gggtgggagt 
ageccattgt 
ccagacggct 
agccccagcc 
ttttgecate 
cggtcccctg 
ctgcagccat 
gtgetaaect 
ctgctccccg 
cttaaaaagt 
agattaatta 
tcagaaatca 
actgcagtga 
tggactggag 
tcccattgct 
gagttggcat 
gctctgtcgc 
ctgggttcaa 



acattggcta 
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139081 
139141 
139201 
139261 
139321 
139381 
139441 
139501 
139561 
139621 
139681 
139741 
139801 
139861 
139921 
139981 
140041 
140101 
140161 
140221 
140281 
140341 
140401 
140461 
140521 
140581 
140641 
140701 
140761 
140821 
140881 
140941 
141001 
141061 
141121 
141181 
141241 
141301 
141361 
141421 
141481 
141541 
141601 
141661 
141721 
141781 
141841 
141901 
141961 
142021 
142081 
142141 
142201 
142261 
142321 
142381 
142441 
142501 
142561 
142621 
142681 
142741 
142801 
142861 



ggtggtctcg 
tacaggtgtg 
aggtatttgt 
gcacttttgg 
gtgaggcctc 
ggtcccagct 
gcagtgagct 
aaaagaaaaa 
gtcagagctg 
cctttggtga 
ggctccgagg 
gatgctgagc 
ggtccggaat 
ccaagccctg 
gtggcccctg 
tcctgctggc 
cctgaaaaat 
caacaggaca 
aaggaagagt 
cctctaggcg 
gtttccactt 
tcctcagagc 
aaaggaaacc 
caggctgatg 
ctgtcctaca 
aaggggggca 
gctgtggtgg 
gagttgtgcc 
atccgacatg 



aactcctgac 
agccaccgca 

gaggccgagg 
atctctacaa 
acttgggagg 
aagatgctgc 
atatgactgg 
acagcccaga 



acgggtgagc 

tgagagtcgg 
ccaccttcca 
ctgtgctggg 
ggagggaagg 
tggtccagga 
ggggaagagc 
aggatggggg 
ggtccgtggg 
gcaggggctg 
agggacgggc 
gcaaggaggt 
ccagcagtca 
aggaaagtaa 
cgggagacag 
cggataagga 
cggcctgggc 
gggtcaggag 
gacaagtcaa 



acaaacaagg 
aactgcaaac 
actaaatgac 
agctttacaa 
ccctgtcaaa 
gcagaggggc 
gacgacttac 
aattgatccc 



accaagaggt 
agccagaagg 
ctgaggcccc 
cttgggccct 



gctgcccagc 
acccatgcaa 
ggtggtctgg 
cccccgcccc 
ccccgtgtgc 
caagcggagg 
ccttcaattc 
actcttttcc 
cactgtgcag 
aacgggcctg 
caaaaccagg 
ctgagagcat 
tggctttgaa 
ttctgggctg 
gcccagtagg 
ccgacaggaa 
cccccagcct 
tttgtcaact 
ccctgtagcc 
ggtttgggat 
cagttccctg 
tgagcagggc 
ggtgggagga 
cacacgtcct 
agcaagggag 
ctcctccaac 
gactgagggt 
aaaccgagga 
aggcctgagc 
acctcagcag 
tgcttctgag 
aggggcaatg 
gggtgagcag 
ggaatgtcga 
gggaggaggg 
tggtacaggc 
aggaccgcga 
agagggcagg 
ggtggtgcct 
gcataaagat 
ccatcaatgg 
catgccgtct 
ccagaaagca 
tggaaactcc 
accaagggcc 
gggaaatgcg 
acagactcat 
aaccgaaagc 
ggcctcccct 
acctgcctgg 
gaagttggta 
ccctccccac 
atgtctccca 
tgcatcatgg 



cctgtgatcc 
cctggccaag 
cattccactt 
caggaagact 
ataatttaaa 
ctgaggtggg 
cactgcactc 
gaaccgggag 
aacacgtcca 
tgaacaggga 
agacccaccc 
gtctgaccca 
agattgggca 
cactgcagct 
ccagcttcca 
ggcttgcttt 
cctcattgag 
accccaaatg 

tggggaactt 
ccttgagcac 
ctggcaccag 
aagacacaga 
gcgggtctct 
cagacctgat 
gccagggctg 
ccccgccctg 
cctttctgtt 
cgaccaggca 
gggacttcca 



agaggagccc 
gctggaggct 
caaaagccct 
gcgtggagac 
ccaggacgcc 
cctgctcagc 
gctggcggtg 
cgtggggaca 
acgctttggg 
aatgctgaca 
ccggggcctg 
ggagcgggaa 
agtagatcct 
atggggctgg 
agggagcaag 
ggcaggtgga 
agggagaggc 
gggaagacca 
gcaaaggtgc 
gtttaacatt 
gtaacacatg 
gggtgtctgt 
tgttgtattg 
gcagcatctg 
aacggccact 
tgactcggga 
aaatgagtat 
gcctgcctgg 
caatgcagaa 
tggggacccc 
agcctgcccc 
ggaccctccc 
ggcataatgc 



gcctgccctg 
gtttttttgt 
aatgactggg 
gcttgagccc 
aattagctgg 
agaatcacct 
cagcctgggt 
aaagatgttg 
gccagccaat 
agatgggtgc 
tagggggtgc 
cccacgtggc 
cccacagact 
tccccgcagg 
aggtggcctg 
ctagggtcag 
tctccagtag 
ctgacagcct 
tggaggccct 
ggcgggttcc 
cctactggag 
gcctggctga 
gaagaaccag 
gttgggctcc 
ggactggctc 
gggagagaga 
caagctccct 
gatcccttat 
gccctcactc 
cactccagcc 
agactcagga 
cgtccatggc 
ctggaaggga 
tccatattcc 
aggtggggct 
ccactgcagg 
cacacagcac 
cctcggacga 
ggagaatgac 
tgatgccaga 
atgatgggac 
tctgtgctcg 
ggtcccaagg 
tcagcgaggc 

tacagcaaag 
gatgggctca 
ggcgggcaga 
gctgcctgcc 
aggccgaaaa 
gcagttagag 
cctgaaacat 
agtatcatca 
acaaggcttg 
ccatggttag 
cccagtgggt 
aatgggccgg 
tctttacaag 

acctgagatt 
cttctgcccc 
tttctcttga 
tcaagacccc 
caactcccca 



gctt< 
ttttttgttt 
aactggggcc 
aggagttcca 
gtgtggtggt 
aaaccctgaa 
aacagaaaaa 
atgaaggcac 
tatttttaat 
atggctagaa 
caggatcaac 
ccaaggaggc 
tggggagtgg 
gctgcaccca 
cccctgatgc 
caggaaatgt 
acttaggtca 
ctgaggcagt 
cctgtcgctc 
cactggccgc 
gctgctgggt 
ataaatagat 
gtccctgcgt 
ccaccccttc 
ctcctggctc 
cagaaggttt 
cgtcctgccc 
gtgtcccagg 
tcatagtcca 

ggctggggca 
ttcaggagca 
gctgaaacca 
gtggaaggag 
aggaaatgcc 
gtgccaaggt 
tgcagtgatg 
cacaggatcc 
gagccatgga 
ccgggagcag 
gaggaggggg 
ggggatgctg 
tggggcccct 
tcagcaagga 
aggaccagtg 
gtgggtaagg 
gagaggctgc 
tgctcccagg 
aaggacgggc 
atcaaccaga 
gctaaaagag 
aaatagaact 
gtgaaagcac 
aaacacaagt 
gacaggtctc 
attgatttct 
catgggggag 
ccaagcctgt 
gctgcttctc 
actccctgct 
ctgttccctt 
ggcttccctc 
cccctactga 



atgctgagat 
ttttgtttta 

gaccagcata 
gcatgcctgt 
ggtccaagct 
gacactttca 
gcagcctaca 
gaatgccctt 
gctctgtcca 
ccgacccagg 
agcttgtgga 
aagggccacc 
catccgcagg 
cagagaaact 
caacattttc 
tgtatacact 
ctaaagaagg 
tcaactactg 
atggatcatt 
ttgtctgtca 
tctggattag 
cccttccttc 
accttctgcc 
tccctgggtg 
tatttctccc 
gcagctctgt 
cccctgactg 
ggtgctggtc 
cctgaagcac 
caggctggtc 
gtcccgcaag 
gccaccaagg 
ctagtggggc 
gctgccatga 
tttctggctg 
gcagagccca 
gtgaagcagg 
acacctctgt 
ggaggtggcg 
ccggagctca 
aggtgggtga 
cctgcagagg 
gctcggaggc 
aagcaggagg 
gcatccgact 
tccccagggg 
tgcgtgctgt 
gtggacaggg 
cactggtatc 
ctgagatgga 
agacagagaa 
cagagcaagt 
acagtcacca 
ggctcaacag 
ctagaccaca 
ggatggatct 
catttacaaa 
cagccgtagc 
ggcaccaaga 

ccccaatacg 
tccagggtga 
tgaagtcact 
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142921 
142981 
143041 
143101 
143161 
143221 
143281 
143341 
143401 
143461 
143521 
143581 
143641 
143701 
143761 
143821 
143881 
143941 
144001 
144061 
144121 
144181 
144241 
144301 
144361 
144421 
144481 
144541 
144601 
144661 
144721 
144781 
144841 
144901 
144961 
145021 
145081 
145141 
145201 
145261 
145321 
145381 
145441 
145501 
145561 
145621 
145681 
145741 
145801 
145861 
145921 
145981 
146041 
146101 
146161 
146221 
146281 
146341 
146401 
146461 
146521 
146581 
146641 
146701 



ggcagagtgc 
gccgagacaa 

aaccgggtct 
ccttccctca 
cccatggcct 
ctctctctgg 
gtgccacagt 
atagaagctg 
gaacccaggc 
tggagatgag 
aggtcatgag 
agcttgtgca 
agcagccctc 
actgtgagaa 
agcctgaacg 
gggtgtggac 
gaggcattca 
ctggaatggc 
aaggtctgaa 
caaccactcg 
atcataaatc 
gaccaaagcc 
taaatgaagc 
tgttgataca 
agatggaaaa 
aaacacaaat 
aaggtgggtg 
catctctact 
ctacttggga 



ccccaacgcc 
cccactgccc 
tgcgtcttcg 
cttcaatcct 
cagcccttct 
ctgtccctcc 
ccgcccccag 
gaccctgcaa 
agcaaacacc 
tgtccccttc 
ctcttcctgt 
ggtggggacc 
ccctccactg 
accagacacc 

gagggacacc 
gggctctacg 
caagcccttg 
aagatgcaga 
aacatttgga 
gaagtttttt 
tttttaaaat 
accaccagaa 
cagtgctcca 
aatagcatga 
acggttagtt 
ttgtccaggc 
aatcacgagg 



=tg 



acccacattc 
actgacccct 
atggccagct 
gtcaggagat 
aaaaattagc 
aggagaatca 
actccagcct 
gaatccttgc 
ttagcaacct 
ccaggttctc 
gagactatct 
attattcaat 
aatgtaaatt 
aatggaaaac 
caacactttt 
gccagacccc 
gctttagcag 
agaagaatga 
gatattattt 
cgcaagccag 
cgcggagcca 
gatgtcccca 
gccagccccc 
ctggtcccag 
cgctcctgtc 
ccagcctggc 
acttgagatc 
acctctgctt 
gattgaaaat 
cagattggga 



aagtccacaa 
ggaagctcta 
cttggcttgt 
ctctgactct 
cacacctgta 
ggagaccatc 
caggcgtggt 
cttggaccca 
gggcgacaga 
cattacactg 
gacttccatc 
gggactggga 
caaaagggta 
gataatgacc 
attgtgaatc 

tggctggtct 



acatttctca 
tgtgcacact 
tatttaagag 
cagacagata 
gaagaaagcc 
cacactgagc 
tcctggcctc 
ctgcaagaac 
ccccaggaaa 
tgtcatccct 
tttagactta 
ttccgtgtct 
aagataatga 
aacaggctcc 



ctccactata 
accaccagct 

gaacaagaag 
cagcaggctg 
tccccagggc 
ctgctccggt 
cgcagattcc 
aggagacaga 
actcctaatc 
cccaaggtga 
ccgttaatgg 
tgtgcaaaca 
aaacccactg 
gttgtttcta 
tgcagagcct 
gaactgggag 
taggaaactg 
tataaatgct 
ttatactagg 
caacattcct 
gatgacagtg 
gcaagttcac 
gtcaagaggg 
gtgacaagat 
tgtatagctg 
gcggtggctc 
tcaggagatc 

ggagaatggt 
tccaqcctgg 
cacacacaca 
tgggtctcac 
aggagaatgt 
ggccccctcc 
gaacttctgc 
atcccagcac 
ctggctaaca 
ggcaggcgcc 
ggaggtggag 
gtgagactcc 
tgccatccag 
tgtaaccttc 
catggacaac 
ggatttcatg 
atatgttttg 
atgttttgtc 
aaagaatagc 
tttgtccccg 
tgtggctcta 
aagtgaagat 
caaaagtgca 
tcttcataca 
aaccgttatg 
aattaaaagt 
acaagcccca 
agtgaagcct 
cgcctactgt 
aaaaaaaaaa 
gaacatcctg 
aatccccact 
aaaatggcac 
aggcaaagga 
aagagatgac 



atcccagagg 
ccctcagggc 
ctctaaagtc 
actctgcttc 
gcctctgctg 
cccggggacc 
ttccacagca 
gtgctccctt 
ggcaatttca 
tgtggcaggc 
tggtgtgaag 
gattagtgcc 
cagcaagaag 
gcgccccgat 

gggatgctca 
atgcctagaa 
agatctgcgc 
gacaagaccc 
aatcaacgca 
ctaaatagct 
agaattcagc 
aaccttctta 
cagaagagag 
ctcacttctg 
taacaaatta 
acgcctgtaa 
gagaccatcc 
aggcgtggtg 
gtgaacccag 

cacacacaca 
tgggctaaaa 

tccatcttca 
ctccatcttc 
tttgggaggc 
cagtgaaacc 

cttgcagtga 
gtctcaaaaa 
gataatctcc 
actgcccctg 
tttgggagaa 
gtttttgctt 
tatgttcaaa 
agttgttgtg 



acggagagct 
tctccacaag 
ctctctcagc 

ctccctccca 
tccacaccct 
ccagaggcca 
cagaagccca 
gaaattcccc 
ctggccctgc 
cgggcctttg 
cttatcagag 
gccccagcta 
ctcacatgtg 

atgctagagg 
tggttagggg 
aatttaaacc 
aagcttaata 
cttaagttac 
acatcagaac 
attatagacc 
agcaaccata 
agtgtgacat 
ctgcaaatgt 
tcccagcact 

gcgggcgcct 
gaggcggagc 
aagactccaa 
catttattat 
tcaaggtgtg 
ttttccagct 
aagccagcac 
tacttttatg 
tgaggcgggc 
ccgtctctac 
gctactctgg 
gcctgagatt 



cttgctctga 
tcccttctcg 
ccaacaacag 
acaggacctc 
cgcaccaatg 
cctgctcagc 
gggtcccctc 
cctgacattc 
gaggctcggg 
caagcctcca 
gaaggtgatg 
aggcccaggg 
tgagcgagga 
agactccaaa 
ttgttatggc 
gtggcaggag 
ggttgagggg 
tggggtgtca 
cagagatgct 
acctaaaatc 
tggaatcaaa 
ctgtggatga 
agagtgaaaa 
aacctaaaga 
atatgtaaca 
agtggcttaa 
ttgggaggcc 
ggtgaaaccc 
gtagtcccag 
ttgcagtgag 



gcagtaagca 

ctttaaggcc 
tacaacttaa 
gctctggctc 
cggagaggtc 
aacacaggcc 
ggactcctca 
gatctgggca 
aaaagacagc 
ggctgaagga 
ctgtcacttg 
aatattacct 
agtgagctca 
atgactcttt 



ttatcttaaa 
ctatgtcacc 

tctccttaat 
agtaattttt 
aataatgttt 

tttgagctgc 
tgaggacttt 
ataaaggagg 
aagctccgaa 
ttcagtaaca 
tggcaaaaac 
ccgcaggaca 
tccccagtct 
ctctgtccaa 
aaggttctgg 
agagggcatt 
aagaattgag 
tggctatttg 
actttacaga 
gtatcattat 
caggattaga 



tgacagttc 
gggagggctg 
tttagaggct 
cggagcatct 
aatccttgtc 
ggatcacaag 
taaaaataca 
aggctgaggc 
atgccactgc 
aaaaaaagaa 
agtcaactaa 
tgatttattc 
gtctctcaca 
acttttctac 
aattgtgaat 
aattgtaaac 
gatgtctttg 
ctgaggacaa 
ggcacagcag 
ccctggttta 
taaacacgat 
gtctgagaaa 
tcttcatcta 
gaggccggag 
ggagccggcg 
gtgggaggaa 
aggctctgct 
gctcaggggc 
cccagccgaa 
gcctctttcg 
gttgtcaagg 
gcccatttta 
cagctaatga 
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146761 
146821 
146881 
146941 
147001 
147061 
147121 
147181 
147241 
147301 
147361 
147421 
147481 
147541 
147601 
147661 
147721 
147781 
147841 
147901 
147961 
148021 
148081 
148141 
148201 
148261 
148321 
148381 
148441 
148501 
148561 
148621 
148681 
148741 
148801 
148861 
148921 
148981 
149041 
149101 
149161 
149221 
149281 
149341 
149401 
149461 
149521 
149581 
149641 
149701 
149761 
149821 
149881 
149941 
150001 
150061 
150121 
150181 
150241 
150301 
150361 
150421 
150481 
150541 



gtggtggatc 
ggggcatctc 
gcacagatct 
gcatctcctt 
tggagacaac 
ccctgctgcc 
ccggtggcag 
cgagggcagg 
cttccaaggc 
gtcctggccc 
agtgagtagg 
gtatgaaaat 
tttgggaggc 
cagtgaaacc 
cctgtagtcc 



ctgcggaggc 
cttgtggccc 
ggccacaaag 
tcccgggggg 
cgtgaggaca 
cactggagct 
ggctccagca 
ttgaggaaag 
cgagggacag 
cgggggcaga 
tgcagtgggc 
atgttgtgag 
gcagggatct 
ctcaacacat 

agctgaggac 
aggttcagca 
aggcaggtca 
ctgttgttgt 
ctcatttttc 
cagcccacag 
ggtcctagaa 
gtagttgcag 
agatttaaca 
gcttggggaa 
cacgagtggg 
gaccaatgcc 
aggactgcac 
aacttgcagc 



ggatacagtc 
aaatacaacc 
atcttaaagc 
atggaaagat 
ttttagaact 
cagaatggat 
ccaattcaaa 
gaaaataccg 



aaacctacca 
cccacacgtt 
aagaaaaaca 
caaaactgtg 
agaactgtca 
taaagacatt 
gatggttgct 
acatcaggaa 
tgcgtatttt 



cggatctgct 
ttgagggctg 
ctctccactg 
tgtgaaccac 
ccatcctctg 
ccagcagtgt 
cagggcgtgc 
cccccagctg 
agccctcagg 
agcacagccc 
aagccacaca 
agacgggggt 
tgaggtgggc 
ccgtctctac 
cagctgctct 
aagcggggga 
agagctgcct 
agtgggtccg 
gtgcattcac 
gcaaatggca 
acagcacaga 
gaggcctaaa 
cagagaacag 
ggaagaaggc 
gagggcagag 
ctgcacagag 
agcccctgga 
aattgctggg 
gcctiatggct 
agagaggtcc 

aacccgcaga 
aactccagaa 
agctcaagga 
tttcctattt 
gcacctaaaa 
aggtggagtg 
gaagtgtgca 
aggagccgca 
ctggcatcag 
tctgacccta 
caggtcccag 
aggctttgaa 
cggaacccaa 
tagcatttaa 
caaaattact 
aacagataac 

agtctcagca 
gaaaaataca 
atgacagagg 
ctgcagcgag 
aaagatctaa 
atatttggat 
attcaacaga 



cgcacagggc 
ccctgtggcg 
ggcagcacct 
tgggcttaag 
ggaaaagaca 
cctctcatcc 
gggatgcccc 
cttgggaaca 
ccccaggctc 
ttccgccagc 
ggagcagaag 
gatggccggg 
ggatcatgag 



gtgcattact 
aagtccaaag 
acccagaatt 
ctcagatgaa 
gaaggaagtt 
taaaggaaga 
aaagaggaat 



gggcgacaga 
gtgataaggc 
cctcctgggg 
gcttcccaac 
atttgcgacc 
gtaagcaagc 
gtattgttga 
tgattggaaa 
cacatgcaga 
caatgtgagt 
tggtgggcct 

gggtggaaga < 
ggcaggatct < 
gtatttctgc 1 
ctaggagaag c 
tgcctgcagc < 
aaataacagc i 
gtttactctt < 
gcgggcactg 
tagaaaaagc 
cctttttctt 
ctctaagagg 
aaaccccatt 
agacagcagg 
ggaaactgga 
aaagttggtt 
aacaacacgc 
acgtgccact 
aatggacctg 
ttaacccaag 

tggcatacca 
aagactgaga 
aaaatactcc 
agaaacggaa 

aatgagtcag 

ctaacagtca 
aaataatggc 
tcggcaaacc 
tctgaaaact 
tataggggaa 
gaagtgcaaa 
ctatctccag 
gcaaaactta 
cttcagacag 
gcaacagaaa 
gataatgctg 



tgggctcctc 
cccagatgcc 
gagtaactgg 
tatgctgaac 
caaatgaggg 
agcgtgtgaa 
gctcacagtg 
gtcagcctgc 
cgggactccc 
ctctggccag 
ccgggacttc 
tgcggtggct 
gtcaggagat 
taaaacatta 
gcaagaccgt 

gtcctctgga 
ctccaccttg 
tgtggctgaa 
aaagaggaag 
ggggggcggg 
gaggcagcga 
ggacccgagg 
ggagtgtgga 
gggtggggta 
tggggggagg 



cccagccccg 
gcagggagac 
gggagagaag 
ttcagttttg 
caggacccct 
gaaggagtgc 
tgctgtgcct 



atgcagattg 

ggtacagaca 
atttccgaat 
ttcttcgtgt 
ggagcctggc 
gctctttcct 
gtcaataaag 
cagggctagg 
agaaactcct 
cataggttct 
gggttgtgtt 
acattgaacc 
ggttaattac 
aagtctcata 
agaaccagga 

ataaagtaag 
gacatatttt 
ctttaagaac 
tgaacttgag 
ttataatgcc 
tgtcccagaa 
tgaaaacttc 
tcaaatgaga 
atagacaaaa 
aaatggttca 
acattgttaa 



tagaatgtgt 
ggggaaaatg 
tacctgaaaa 
tctgatgtag 



tgcagaggaa 
cgtcaggaag 
tgcctctcac 
catgcctgta 
ggagaccatc 
gccgggcgtg 
ctcaaaaaaa 
ccaaactgca 
ccaggaagga 
ccagcagcac 
tgcagaccct 
tcgacagaga 
ggacttcctg 
tggagagatc 
tcagaacaga 
gagcaaaggg 
caggaggtca 
agattggatt 
ccctgtcagt 
ctctatgaaa 

cagtaggatg 
cactggattc 
gggaagaaga 

gacagagctc 
gctcagagcc 
tctcacacgc 
tgtgtagctg 
cttttggtcc 
ccccagcttt 
aagatggcag 
gggctcgtgc 
ggggactgac 
tacatgggat 
gcaatccaca 

gggtaatatc 
aaatctcaac 
gttggaatta 
ggcaaaaact 
taaaaaggac 
tcactggata 
gatagagcaa 
aagagcctta 
ggagaagaga 
tcaaattctg 
gaaatccaag 
aaaagtcttg 
gataactgca 
aatgctgaaa 
cttcaagaat 
ggccagcaga 
atgctagaaa 
aatataacag 
ttttccatgt 



cactgggaca 
aaggctgtgt 
tgggtttcaa 
acgcagaact 
gcaccatgtg 
agccggagag 
gggagaggag 
cctgggtgcc 
ctgcaagcca 
gggagtgtag 

ctggctaaca 
gtggggggcg 

gcaagattct 
gccctgagga 
cgatagatgt 
gggtgctatc 
gtgacgcagc 
gaagagatga 
tggggaagtg 
cctgccatgt 
tgggcggggg 
gagtggtggg 
ttctcctctg 
tgctctgggc 
ttcctgttca 
ttataaaccg 
gtgtgagcac 
atgagcagct 
ctagaatgtg 
cctagataca 
gagacaagcc 
aagccaacac 
cagctcatgg 
gaagtgctgt 
tcccaggagt 
ctggccagag 
agacagcaga 
ttgagccgtg 
ttaggaggtg 
tgatcagaat 



ggc, 



tgg 



aaagtgtcca 
tcctccaggc 
actgacaatg 
ctggcaacta 
caaacggaaa 
ggctcaagag 
tagaagttac 
gagactgtgg 
aagaatgtat 
tgaaatgcat 
gaaacacaat 
aaagcagtca 
gatttcttat 



taaggagaaa 
tctaccctaa 
gaaattttga 
attaggctct 
atgtaggtgt 
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150601 ataaaagaca acaacattca ggggaggaca gagggaccta tatgatggca gggtctcaac 

150661 attcaacttg aagtggtaaa atattgattt gaaaaagact ctgaagagtt acataggttt 

150721 attataatcc ctggagcaac cagtaaacac tatacaaaga tatacaacca aacacaatag 

150781 gtaaattaaa attgaatacc aaaacatgct caaatagaaa aagcaaaata agcccaaagc 

150841 aagtagaagg gaggaaatga taagagtaaa aatcaatgaa actgaaaata tgaaaataat 

150901 agagaagatc aatgaaacca aatttggttc tttgtaaaga tcagtcaact gataaatctc 

150961 tagtgagact gaagaagaga aacagaaaaa aatatgaatt accagtatca gaatattttt 

151021 aaaaaggata tcactaatga catcacagcc ataaaaaaga taatggaata gtataaacaa 

151081 ttctatgcat ataaatccat cagtgttgaa gaaatgcatg atttcctcaa aaactataag 

151141 ctaccaaaac ttacccaaga caaagtagat agcataaaaa gttctgtaac tattaaagaa 

151201 aaagaattca tagtttaaaa tcctgcaaaa aagaaatttc caggtctaga cggttttact 

151261 ggaaaattct actaacctct caggaaaaaa tggcaccaat tctcccattc attttatgag 

151321 gtcaacatta ccctgatatt aaaaccagac aaagacagtt caataaaaaa aaaaacccta 

151381 caaaccaata tccctcatga acctataaac caatatccct caatgaacat agacacaaaa 

151441 atcctcagca aaacactggc aagtctagtc cagccatgta taaacagcag agtgcacaac 

151501 agacaagtca tgtttgtcct ggatttgcag agtctggcac aatatccaaa agtcagtgta 

151561 acccagcata ttaacagact gaagaaggaa aaaaaatatg cttatatcaa gtgatagaga 

151621 aaaccatatg atatgattca ttcatgacta aaaattaaaa actctcaact aacagagaaa 

151681 gaaacgtgct caacctggta agaatatcta caaaaaaact aaagctaaca caataccgca 

151741 tcctgagaaa ctgttttccc ctaaaataag gagcattttc acgactccta ctcaaccttg 

151801 ggttgagtcc tagacagcac ctctgtacac acccagcacc cagcacagac cctcaggcca 

1518 61 gggcaggaaa gggccccaga gagcttcccc aaaacccgtg ttcctgggga aggtggatgt 

151921 ccaggaaggg tgactggccc ctaaagccta ggggaagcgg gggccctcct gctccagcct 

151981 catgaagctt tgatctccag tgccaagccc ggtgccagca gagagccagg cagatgaact 

152041 caaccaggga aaggacaccc aggaaggaag caactccatg agatcgagtc tcatagcacc 

152101 aagtgctccc acaggctctg aggacatggc tcttattcca ggaccaggcc tgacaggagc 

152161 acagcctgag ggatgagagg gcagggcagt cttctgctgg gtggggaatt ggccaccagg 

152221 gccagtccca catctgttgt ggccctggca ggatcaaccc tgggcctgag tccttggagg 

152281 tgaggccagc acgtagtaga ggggcagccg agtaagaaga atctctcggg gaagtcagtg 

152341 gccagccagc ctaagcccgg cccagaccct cgccctgcta gcatgagaca gacacagggg 

152401 gcccgcactg cagagctgcc ccggcccacg cgcccctgcc ctgctgtgct ccgccatctg 

152461 ccccagcagt gctgtgagag catcgcaccc tgcaagcctc agccttcagg gacggcaaga 

152521 cctggggccc acaggaggtc caccggccct ctcagggctc gagcacccgg ccaaccctgg 

152581 gtggccactg ctcagcactc caggacagct ccatgacaca gaagcagctc tgcccatgag 

152641 cattcaacag cccggtccgg ggggcaccac cacgcagggg cttcccgggt catgctgcac 

152701 catccactaa tagcaccatc acgtatgcca ggctgggagc ccgggcgtcc caagcctgtg 

152761 acctcattct ctatgctcct cacggctgcc ccacagaatc tctgaaacac gactcagacc 

152821 tgtcactgcc tgcccaaagc cctccagcgg cttcccacca catgactgcc ccggtccccg 

152881 ctgatttctc cttcactcaa gtgccagccg tggagatctc ctctgtgtcc ttggcacact 

152941 tcaactcagc acctgcctca gggcctttgc acctgctgct gcctccacct ggaatggcct 

153001 tcttccagaa gtccaggtcc tagctccgat gcctcctcac aaggaccttg gtgaagggtg 

153061 tcccttctcc atcacactcc acactgccat cctgtcctat ttttttgtaa caccatttcc 

153121 caatcatcga cacgtgggtt ctttattggg tcactctctg ccccctgacc agggatctta 

153181 cgagagcaaa gttgtccttg ctgggtccac gtcaccagac cagtgcctgg catctcacag 

153241 gcatggagtt gagtacggga ggtcgccggc cctcccgagg ctcccttccc ctacttcggg 

153301 atggtgtaag gtttccttgc caaggtccct gtggtgcaga gtgggccaag tttgccgcct 

153361 tctggagcga caacaaggag aaaacagggg tttaagtgga ggatccggtg ctctgatttc 

153421 tccccagtta gtaaagcatt caggctgctc ggtggaggtg ggaagacagg cgctgagagt 

153481 tgtctgacac aattgtctta gcagcaggag acgctgatgg gatgtcggga gtcaccagga 

153541 ccatctgaca aatgaccgcc ttggggatcc gtggagctca gcggtgccca gagagagcac 

153601 tcgaaatgcc agctggaaag gtcacgcacg ccagctggaa ggggagccgc gcagcccctg 

153661 ggcaggctga ggacaggaac gggcagccgc cactgttcac gagcaagggt gcaggggcac 

153721 cagagcccac aggagctcct gcaatgcgat gaccatgcct gcaggccccc ataatagaca 

153781 gcctcatttt ttttttctag gcaaaacttt gggaggatcc tttttaattg tatttgcaaa 

153841 acagtttctc ccaacatgat gcccatgtgt atgtggattg ggggaaattc ccaaggcctc 

153901 tcagagaagt gcctccttcc taaggtggtg tcccagcaga tgaatctggc tggcgtcctg 

153961 agtggaagca gggcgcgggc tgccatggaa ccccaccctc ctgcaggttc ctggcccctc 

154021 tgcctcctct tccttcccac cccaggggcc ctgggatgca gctaggccag ctgggagcca 

154081 gacagccact gaagccctcg gggtactgtt ccctgccttt agcatccccc tcagctgccc 

154141 attcattcag ccattcttct aggaggacag gccaggccat ggtgaggccc tgggtccagg 

154201 gaaagatacg aagagggctg aggcgggcgt tcctgccctt gggtctcttc cgctgccctg 

154261 gcagccccgt gcctcctgac tgtggactgg agttctgcag ctgcgccatc cacaggaagc 

154321 tacatggaga gatgggccag ctcagcagag aggcgcaaac accaattcag ttgaaaactc 

154381 tgtgttcccc gggtagaggc cagttaggag ccagcatgcc tcccaagggt tgtgcaggga 
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154441 
154501 
154561 
154621 
154681 
154741 
154801 
154861 
154921 
154981 
155041 
155101 
155161 
155221 
155281 
155341 
155401 
155461 
155521 
155581 
155641 
155701 
155761 
155821 
155881 
155941 
156001 
156061 
156121 
156181 
156241 
156301 
156361 
156421 
156481 
156541 
156601 
156661 
156721 
156781 
156841 
156901 
156961 
157021 
157081 
157141 
157201 
157261 
157321 
157381 
157441 
157501 
157561 
157621 
157681 
157741 
157801 
157861 
157921 
157981 
158041 
158101 
158161 
158221 



gacttcagcc 
acaggcctgg 
cacaggatgt 
aacagtgacc 
aagcctgaag 
actttgggag 
acatggtgaa 
caccggtaac 
ggagttgcag 
cagtctccaa 
gtgggtcacc 
agggcaaaac 
cactgcatgt 
gtcagggagg 
tcctgtggat 
aggagagacc 
gcgctcaggc 
cccaccaggc 
gcagaaacac 
ccagctccaa 
ctgccttgca 
atctcaggat 
tctgggccac 
ggctatgcaa 
gaatccaggc 
ggtctgatga 
gtgggtgtag 
tcctggggac 
ggcatggccc 
aacccacttg 

tgactcctat 
gatgcgtatg 
aaggccacgt 
cctctggctg 



aggcacccac 
ggccattggt 
ccccccaaag 
cagacctgac 
cctggtccct 
gccaaggtgg 
accctgtctg 
cccagctact 
tgagccgaga 



tggggcagac 
ggatagagct 
ccatcccagt 
cttcaactct 
ccagacactg 
ggcgggtcac 
tactaaaaat 
caggaggcta 
tcatgccatt 



ccagtcccac 
ggagaaaggc 
cagatgtctg 
aactgtgcac 
cagaaagctg 
gaggtcagga 



atgggccagg 
cagcaaagcc 
cttgaccaaa 
cagtttatca 
tgatatctgg 
ctgtcttcag 
ctcttagtaa 
gcccgtgctg 
gcggttcctg 
gtatcattga 
gacagcagtg 



taga. 



:tct 



gaccatggca 
ggaacccata 
ggatcctgag 

tggggccttt 
ccgatcagag 
tgccttcttg 
tgagaaataa 
ctgcatggac 
agcaagatga 
atgtcggccc 
ggacctggaa 
ccgccacctg 
caattctaat 
tacaatgggg 
gctgggaggg 



tctccgtgcc 
accctgggtt 
ccctcctgtc 
ccttgggagt 
ctgcccaggt 
accccggact 
gctgagatgc 
tgcagctccc 
agtgaagaaa 
gctggaggat 
gggtacagta 
ttccaggaac 
ccaccggtgc 
aatgtggcct 
cccatccaga 
cattgaagtc 
aggccctact 
agtqaaaggg 
aggacagcag 
ccggctgggt 

ctcaacaaca 
ttcagcagat 
ccccatccct 
gccacttctc 
ctgctctctc 
gtcagacagt 
tcaatgacaa 
ctttagtcca 
agaactctgc 
tcaggctcgt 
ctagaatcat 
ttccccatcc 
tattcggagt 
tgtctatctc 
atcatttttc 
gggaggaggg 
gccccaggct 
gtggtcagct 
aatgttacct 
atggagtgac 
cccaaattcc 
gggaggtgat 
ggaccccaga 
aaccagggag 
gcttccgatg 
taagacaggc 
gccaggcaga 
tgaagcccag 
gagcccagga 
cgttttgccc 
gtttcaaacc 
aactctgata 
gcagcagtgt 



tcagtttctt 
tgcctcaggg 

agcctgcggc 
tgatgccttg 
acgcctgccc 
tctgctgtct 
agcgggcagg 
gacagagaac 
caggaggcat 
tggcctagag 
aaggagaggt 
tagaatcccg 
ggagcagcct 
ccagctgaac 
tgggacacac 
gccctgccct 

ctcagcaggg 

ccagcccccc 
acaggaaacc 
gttcactctg 
ctcccccagg 
tggaagggcc 
tagcccatgg 
ggtgagcagc 
caaaggacct 
gctcctgaac 
taagtcacga 

gttaggtgag 
cctggcccct 
tgagcccagt 
catggtcctg 



gaaaaatggc 
tccatggcaa 
tgctggattt 
gatgttgaaa 
tatgtcacca 
gagctccctc 
cagaccctca 
cttttaagcc 
cctaaatgca 
ggagaaggcc 
agcgaagagg 
cgcatcctcc 
tgttgaaact 
agcaagcttg 
tggactaaaa 
tgtcagtgag 



aggcaggaga 
gtactccagc 
aaaaaggctc 
ctatttaaaa 
cagggcagga 
cacccctgcc 
cctggggaca 

agggctccct 
gctgtggggg 
gatctaggca 
tggtcttgtt 
ctacaagcct 
agccccagag 

gctggggtga 
gggggccccc 

ggggacgcag 
agaccatggc 
tgcccaaccc 
tccgcccctt 

aggccgagcc 
gcctggcgct 
tcgtccagct 
gcaggcgccg 
atgccaagaa 
gaacagtgga 
cttctccttc 
atgccccatc 
acccctctgg 
tttagctaga 
accctgctcc 
ctctctcccc 

tgggaaccct 
gctgcagagg 
ccccgaagac 
aaaagaactt 
tccgggtggg 
gctactcccc 
gattccatcc 
accctttcag 
ccacaaccat 
gctccgttta 
atcacacttc 
acgtacaggc 

cctagaggct 
gatttcagcc 
tagtaatttg 
gtggtgagag 



tgcttctgcc 

agcttcctag 
gggctgactg 
aatagagacc 
tagccacgca 
atcgcttgaa 
ctgggcaaca 
tggccctgcc 
tgcatagggg 
ccctggaccc 
ctctggcttc 
gacaagggtg 
cctgtcaccc 
gcacgtccca 
cctgtggacg 
tgccttgctc 
gtctttgggg 
ttccgaagcc 
ggctcccctg 
agaagaatct 
gggatggtgg 
gggagcttgt 

cctcccaggg 
aggcatcagg 

cacctgcccc 
tccaccccag 
ccaccgagtg 

cctggcccag 
ctgtcctgtg 
cctggctcag 
tcacagggac 
cactgacagg 
gcccacttcc 
tctccatatc 
tgatgtctgg 
atccccgacc 
ttggctatca 
gactgcaaga 
gccttaccgc 
gctgaaggga 
gtgccgctaa 
tgcaaatata 
acgtccgggt 
tgtagatgcg 
cccagggata 
caaggtgata 
acacgaatgg 
catgtgagga 
atgcacagct 
tggtatttgg 
tctttatcag 
agagcagagg 
agcactgctg 
ccggagggag 
tccagaaccg 
caggcagcct 
aattatggta 
ctcttccagg 



ttcagtggcc 
ggacccaggt 
ctcccctggc 

taatcccagc 
atcgtggcta 
tggtggcacg 
cctgggaggc 
cagcaagact 
acttgaccgg 
aggagctctg 
agcaggcccc 
tggggttcaa 
cccctgaatg 
aggaccctcc 
tgtccccaca 
gccctcctcc 
ataccacaag 
gctcagcccc 
cccccatagg 
gtcattaaat 
gattggcgtc 
tgcagagcct 
tagagatgct 
gttcccaggt 
tccctcaaag 
gccttagtcc 
gggtcaggga 
tgcctgcccc 
gttggagcca 
cagggcccgt 
ataaggggag 
tctgaacagg 
gtggagctgg 
cagggtcatg 
aggcacagca 
atccacctgc 
gactctcctg 
ttgtaagatc 
tgatcaccct 
tcaccccagc 
cctgatgttg 



gcccactcta 
gctttaataa 
aggtagaaat 
gggcatactc 
acagtcagac 
cccaatccct 
gtgagtccaa 
gacagaatgt 
tattaggaag 
gatttgtgcc 
tgcaacaaga 
tccagaaccg 
ctatagcagc 
aggaaggcag 
gagatggagg 
gctgccacga 
cgcggccctg 
tgagaggata 
cagaaaccaa 
gaagaatggg 
gcaggaagtc 
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158281 
158341 
158401 
158461 
158521 
158581 
158641 
158701 
158761 
158821 
158881 
158941 
159001 
159061 
159121 
159181 
159241 
159301 
159361 
159421 
159481 
159541 
159601 
159661 
159721 
159781 
159841 
159901 
159961 
160021 
160081 
160141 
160201 
160261 
160321 
160381 
160441 
160501 
160561 
160621 
160681 
160741 
160801 
160861 
160921 
160981 
161041 
161101 
161161 
161221 
161281 
161341 
161401 
161461 
161521 
161581 
161641 
161701 
161761 
161821 
161881 
161941 
162001 
162061 



aatggacaac 
tcatggcaac 
gtggcctgtg 
agaggagggc 
atgcacacac 
ggacgcacac 
ggccagagca 
actcacaccg 
acgcattcac 
caggggtgtc 
cgagacagag 
acactcgtgc 
tcagcagagg 
aggagggcct 
tgatggtggg 
gcttcgggct 
ccggccaggg 
ccaccgggtc 
tctggggctt 
cgccagctct 
tccccacccc 
acaagagtag 
agcgctcctg 
ctggactgtc 
ccacagggtg 
ccaagccctt 
acctctactc 
ccaggcccaa 
tcaagcccag 
agagcctaat 
tggaagtggt 
cctcctcctc 
tcctgggtcc 
tcctcccaag 
agcactgagg 
cactgaggcc 
gtcgggatcc 
tttgtttccc 
gaagccatct 
agctagaagc 
acgagctggg 
ccatggcacg 
atggccacag 
gcttggtggc 
ctgggggatc 
gggagaaaag 
aatatttcaa 
tatcaataca 
catcaaagac 
taaacaggga 
cagacgggag 
gagggaagag 
ctcagagggg 
agggtgacac 
cggccacagc 
gggctcccca 
tgctgtgtcc 
cagtccagca 
gacgggacgc 
agtccagaag 
aatttaaaaa 
tgaggcagga 
ctgccttcta 
ttttaaaaag 



atcgaaaccc 
cccagaacca 
aagaggggac 
ctggccagag 
tcacaccgag 
gcattcacac 
ggggtgtcag 
agacagagga 
actcgtgcac 
agcagagggg 
gagggcctgg 
acacacacac 
ggacgcacac 
ggccggagca 
gcggggggtt 
ctgcatccac 
ccagggccag 
ggccctgtcc 
ggctgggcct 
gagctaagtg 
gtggcgtggc 
taaatgcacc 
cctctgcagc 
tcccttttac 
acgggccaga 
gtagccttcc 
cagcagacct 
ccaagagctg 
ggcctaccta 
cacagccagc 
actggattcc 
cccctgcccc 
ctgcctcaga 

ccacagttgg 
acagttggct 
cagggctgac 
accattggga 
ctggggactt 
tggcatgagc 
atctgagtgg 
cagaggaaaa 
gaactggccg 
tgcaccgaag 
tggacagagc 
ctgcgagacg 



gatacatcaa 
gccagcagcc 
gcacacgcat 
caggggtgtc 
acagaggagg 

cagaggggac 
gggcctggcc 
acacacactc 
acgcacacgc 
ccggagcagg 
tcacaccgag 
gcattcacac 
ggggtgtcag 



tatataagaa 
aaagagaaag 
ggcagcaggc 
acaagagccc 
gctgccccat 
tcggcaggga 
cctgggagat 
tcagggggtc 
gccccacaga 
ctccaggaac 
aggcccaggg 
agtggtgtac 
ttcaagatca 
attagccagg 
ggatcccatg 
gcccgggaga 
aatacacagt 



acggaggctt 
gactggcatt 
ttgtcttgct 
cagcctcctg 
agcgcagagg 
gtggcgtggc 
gtgctgtact 
cccccgggcc 
agatgaggtg 
attacaagcc 
tgcacccagg 

caagggacat 
gaagtaactc 

tacacccacc 
tgcatggagc 
ccctcccacc 
acgtcgccat 
cttctggaaa 
tctggaaagc 
tgcactcctg 
gccaggatcc 
tcctcccatc 
cctcctagca 
cgcaggtact 
ctgaaacaac 
cccccaggag 
agcagaacag 
tctcgctgtg 
tgggcaagtc 
tacatgaaag 

ttcaaaagct 
agacatccaa 
aggaagtcct 
tccgaggcca 
cagaggccac 
gctgagagca 
attgcatgtg 
acaccctccc 
ctgtgagctc 
cagagcaggc 
acctataatt 
gcctgggcaa 
catggtggtg 
aaccaggagg 
gagcaagacc 
ggtgagaaag 



taaagcctta 
tcacactcat 
agcagagggg 
gcctggccag 
acacactcac 
gcacacgcat 
ggagcagggg 
acaccgagac 

ggtgtcagca 
acagaggagg 
tcgtgcacac 

cccacacccc 
cggattgggc 
tcccgggctc 
cagctcgggg 
gctgccacgt 
agagtgcagg 
cggaatactc 
cagagcccag 
gtgtctctct 
agacagcatg 
cctgattctg 



gga< 



agati 



gcactgaggc 
actgaggcca 
ccagtgaagt 
caggtcagga 
tcccttgtcc 
ccaccagcca 
atgttctaga 
gatccttcca 
ctgctgagca 
aggaactgaa 

ccagaagaga 

agctagaccc 
cccagaggga 

gctggcaaag 
gagccaccgc 
accagagcag 
gcagggccct 
ggcatcccag 
cactcccagg 
caagagaaca 
acggagaact 
ccagcacttt 
catagtgaga 
cacacctgta 
tcaaggttgc 
ctgtctcaaa 
gatggaaaga 



tagaatagca 
ccggggctcc 
gcacacacac 
acgcacacgc 
agcaggggtg 
atcgagacag 
tcacactcat 
tgtccgcaga 
agaggagggc 
atgcacacac 
gaggggacgc 
gcctggccgg 
acacactcac 
aggagttccc 
tgcctatgcc 
gcagcctggg 
ctggtcccat 
gaagcacctc 
ctcactggga 
gagcggcaag 
agttttccct 
gaagcagccc 
ctcacgggcc 
cagaggccca 
ggggcttcca 



aaataccctg 
cacagttggc 
cagtgctttg 
gggggtgtcc 
aacagagctt 
cgctcacgag 
cagtctccac 
gaggccgtac 
gagacaaagc 
caaacgccac 
cagccacagc 
ggagcaagag 
gtcggggagg 
ggatgggttc 

aagagcagat 
ggggcgtggg 
aattcaaagg 
agcagaggac 
cagagatcta 
caggagcagg 
gctcctgcct 
gatggcattt 
gaaggctgcc 
gctgtcgaac 
gggaggccaa 



ctttcagcta 
agggagctac 
aaaaggaaat 
agagaggaaa 



tttcaggact 
ctcctcagga 
acaccgagac 
attcacactc 
tcagcagagg 
aggagggcct 
gcacacacac 
ggggacgcac 
ctggccggag 

acacgcattc 
agcaggggtg 
accgagacag 
aggcagaacc 
cagacctcca 
agcacacaga 
cctgcccctc 
tcgccagcca 
gctgggcggc 
gccaggctcc 
gctttttgag 
tcccacccac 
cacatcctca 
tgtgcttgtc 
cgctcttcca 
agtgccccca 
cttttttgtg 
cccgggagcc 
ctcccctccc 
ggaaggaacc 
cccaccccgg 

cccagagggc 
gcttctggaa 
ttctggaaag 
ggaaggtcat 
ccaaaggggg 
gggtgggagg 
gagaagccag 
ctccggggac 
ccaggaccgc 
acagccccag 
tggagagagg 
caccgcagaa 
acagagggca 
tgagagaagg 
ctagaggcct 
aacctaagaa 
cagatcagtg 
caccgggcag 
ctcttgcctg 
agctgtgtcc 
gcaacccaga 
gctggggagt 
aggcccccta 
gagaccttcc 
tcttcaaccc 
atgaagtggg 
gaaccgctta 

ctttggaggc 
gattgcacca 
atatttaatt 
ggaaggaaag 
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162121 


ta aa 


qaacaaaaga 


aqqaaaqaaa 


gaaattccaq 


ctaattctca 


gcaacagaca 


162181 


attacaqqaa 


cacaaqqaca 


aaqcccaqaq 


ccccaacccc 


acgtggcctg 


ggtccccaca 


162241 


ctccaacc 


ctcctqtcac 


tctqccaqct 


qqcaqcaqgq 




gcatcctttt 


162301 


cctca act 


ttqgcqtctt 


tcctcccaqt 


qqtttcttqq 


tagttcctca 




162361 


taaatcta 


qctacaqcta 


tccttqtccc 


tgaaqatqqa 




agagtcqctg 


162421 


atatqqtaac 


acaatatcac 


acatctcatc 


caaqcctqct 


cggacgctgc 


ccttctqacc 


162481 


c attcc 


tcaccaacac 


aacaqqaaaq 


qaqcttqaaq 




ccacaqcaca 


162541 


qttcctcqtq 




qatqcqaqca 


ggtgggcgga 


gqqtqcactc 


atqaqacttc 


162601 


tataaacaca 


aacccamqq 


ccatcccaqc 




qqqqactcct 


gcacctcttc 


162661 


ctactaccca 


aaccatqqat 
g yyg 


qqccaqcaqc 


actqcaqtqt 


tqqtqtaatc 


tatqtccccc 


162721 


at cca 
gga ggccag 






ct a ac 
gC ^ac^ta 


catqcccca 


cacaaac cc 


162781 


c caca g^g 


cat^catac 
g 


atcacccaca 


actccct; 5 c 


tacatttcaq 


cacqcqtcta 


162841 






acca ct a 




ca actcatc 


cca tt 


162901 




caqacqtaca 




taqaqqtqqq 


qctqqtccac 


aqtqaqqqaq 


162961 


t 


qtaqactqqq 


ccaqqqaaaq 


qttrtqqaaa 


cccaaactqa 


aqqttccctg 


163021 


oootccctaa 


qqctcatcca 


qqqct qqqqt 


qqaqccaqqa 


catqaqqaac 


qqgcaqtqat 


163081 


qaatctqaaq 


ccctqctqqt 


caqcaqaqcc 


tccacctgca 


catcctcccc 


aaaqqtcaqq 


163141 


caqqcttcac 


cct ccac 


tctctaccct 


qqccattaqc 


aaaqaqqccc 


cccttqqacc 


163201 


aaaaaaaaca 


caaacca 


cctcct 


ccttcccatc 


caaqqctcca 


qccactctca 


163261 


tqqaqactct 


a ct a ac 


actcccaqca 


qaaaqgqtca 


tttatctcaq 


aqcttaacta 




qccccttcca 


atta at aaaa 


aataaaataa 


caqcatttac 


tctaaaqaac 


tqcqatqcqa 


163381 


qaqtattaat 


taaatacqct 


qccqctqaqc 
y y y y 


ctcaqcacac 


aqqaqqctqa 

gg gg g 


qccctcatta 


63441 


aaaaggca g 


agac aaagc 


aagacaggga 


ggagaggcgg 


ggccagc ag 


a a at 
acagggga g 




gggagagaag 


gaga gccgg 


gcggggcggg 


gacagccagg 


gacggccaca 


gc gccc gc 


163561 


caggagggac 




gcaaagtggg 






^c^tcc; 3 ? 9 


163621 


ctggtgggga 


ct^ac^tt 


cacacgcaga 


t t 
ggg gc ccc 


t 

gcacacccc 


cgcc ccg c 


163681 


tttttggaat 


c gacag 


agaagacgac 


a ctccttctc 


g ccag ggc 


ggca g 


163741 


cccca gaga 


ccacagagag 


gcagaggaca 


^ a C rr trr 


agaccagggc 


tcccct"° a 


1 63801 






cagga -ggag 






gggg 


163861 


aaacact a 


t tc ct tt 


aa^tca ac 


gacgutgggg 


cct cctct 
cc ggcc c 


gagaggcggg 


163921 




tcttcca 
g c ccagg 


aagg cagac 


cccagcccaa 


at 9 cactct 9 


cctgggccac 


163981 


aaagagagc 


ggtccctggg 


ccagaaagcc 




ct^tcttcct 


ggcaaggggc 


164041 


gtgtgtgcca 




age, gcacgc 


tatttctctc 






164101 




ccct^tat 
gccc g a 


gac tgagggt 


cccaacagc 


tattccctca 


acacacaqqc 


164161 






a^caccac 0 




a cat cat 
cac^ca^c 9 


cacccacata 


164221 


caaa^tcca 
caaa g cca 


ta^catcac 
c^at^caca 


a gcaccacg 


t t cactca 
g gcac ca 


cacgcaggcg 


tcacaca c 
acacat^cat 


164281 














164341 


cacatacaca 


t^tccacaca 


t t cactac 


acacaa'tac 


acacat cac 


ca aaaaat 


164401 


ccccatgcat 


gy 




acacatacac 


t t 


tcatacat 


164461 


tgtatcatat 


gcagg a ca 


t t t 
a g acagg 


ca gcag gc 


t 


t 


164521 




aca gcacac 






gca gacac 


cat cactca 


164581 


cacatqcctc 




acat^catca 


t 

c gacg gc 




ccacacacat 


164641 


°cactcactc 


acaccctcct 


gctgctccca 






actttccca 


164701 


tt cc c 


tacctqaqaa 


ggtgtttatt 


^ ccttt t 


cttattqtat 


qtctctacca 


164761 


cctcatccta 


cccctccca 


ccctccccat 


ccacactcaa 


cca ta aat 


a aa cctct 


164821 


cccctct tt 






tcttttcacc 


caqaaaccac 


aqqaaqcata 


164881 


c t ca act 


qactcaqcc 


ttcacca cc 




ttac cct 


ctccccaqca 


164941 


cccaaaccca 


cqtccccact 


qatcaacaca 


ttcacaccca 


qcctqccaca 


tctccqqcct 


165001 


ggcccacaga 


at cct tc 


cca actcct 
ccagac cc 




act ccct 
gac gccc g 


cct atctca 


165061 




ctct cattc 




i^ct ccca 3 




ccc a ca 


165121 


ccaatqatca 


qtcaacctcc 


ccagccttct 


qtttqccttq 


caqaqccaqc 


actcccaq 


165181 


gagagctgca 


qaqqctqqtc 


aqcaqcctca 


ctqtccctqc 


cctqcctqqt 


qaqqqaaqqq 


165241 




ggaaattccc 


tcccgctcca 




gtgtccttct 


ctctcctgct 


165301 


gtgctcctgg 


gcagcagctc 


cctcctccag 


ccttccaggt 


ctcccaatcc 


tggccccagc 


165361 


cacttccttg 


ggaaccctaa 


gcctgcctgg 


agtccaccct 


cctccatttt 


tttgggtttt 


165421 


ttttgagacg 


gagtctcgct 


ctgtcactca 


ggctggagtg 


cagtggcacg 


atcttgactc 


165481 


actgcgagct 


ccacctcctg 


ggttcacacc 


attttcctgc 


ctcagcctcc 


tgagtagctg 


165541 


agactacagg 


tgtccaccac 


tacgcctggc 


taattttttt 


tttttttttt 


tgtattttta 


165601 


gtagagacga 


ggtttcacca 


ttttagccag 


gatggtctca 


atctcctgac 


cttgtgatcc 


165661 


gtccacctca 


gcctcccaaa 


gtgctgggat 


tacaggcatg 


agccactgtt 


tctggcccac 


165721 


ccttctcctt 


ctgatagcac 


agcccctgtg 


tcactcagga 


accagctctc 


atccctgcgt 


165781 


caccagtggg 


caaggagggc 


acatgaccca 


gcctaggcca 


ggtagaatac 


gttgtcatct 


165841 


gactacatga 


acatagctca 


gggatgagga 


cttgccacaa 


ggtattccat 




165901 


ccaggaacta 


ttgctggaat 


tgttggaaag 


gaaaagctgt 


ttcccccggg 


ttgttgccct 
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165961 
166021 
166081 
166141 
166201 
166261 
166321 
166381 
166441 
166501 
166561 
166621 
166681 
166741 
166801 
166861 
166921 
166981 
167041 
167101 
167161 
167221 
167281 
167341 
167401 
167461 
167521 
167581 
167641 
167701 
167761 
167821 
167881 
167941 
168001 
168061 
168121 
168181 
168241 
168301 
168361 
168421 
168481 
168541 
168601 
168661 
168721 
168781 
168841 
168901 
168961 
169021 
169081 
169141 
169201 
169261 
169321 
169381 
169441 
169501 
169561 
169621 
169681 
169741 



gtgaggaggt 
tgagagtgaa 
caactgaaaa 
tatttttctt 
acttggattt 
tcaaactctc 
aatcctctgt 
gcactcacat 
tgccatgcgc 
tctagcttcc 
tgccccctgc 
aggtgggcag 
gggccaagga 
ccccaggatg 
gaccttgccc 
ccccagtgac 
caaaagctgg 
gtgcctggag 
gatgttccgg 
aggcagatca 
atgtgcatgg 
ccattttctc 
ttttaactta 
gtgctgggaa 
cacccacggc 
ctgaaagtgc 
cagttctccc 
gacaaaccct 
atcagacacc 
cgccagccac 

aggccaggcg 
tgggctgcgg 
caagctggcg 
ctcctgggag 
gggtgcgtga 
gcctaggact 
tcacctgtct 
gaggcaggca 
atccgtggtc 
gaaatggcct 
gggcccccca 
aggagaaaga 
aacagccgcc 
gccgttggga 
ataaactggg 

agggcagaaa 
gtgtttacac 
tatttaatgc 
tattcacaag 
aaactgtggt 
atgcatgctg 
caaagggcaa 
atacaggcag 
tactaaaggg 
gttgcaccat 
aaatacatag 
ataatgatta 
ttttttttga 
gctcactgca 
gttggaacta 
gggtttcacc 
ttggcctccc 



aatgcagcag 
atcgacatag 
tttatttggg 
tttgcagatt 
ccaacacttg 
catctcccgg 
gctcgaggca 
ctaagactgc 
accagcttcc 

att 



tggaatccac 

aaaccctctc 
ggagtgcttt 
atcccggcct 
cagaacactg 
caaagggaag 
acgtgggaga 
cttttggctt 
aagacaggac 
tgccaaaatc 
ccgtgccttc 
agagaagtga 
tcccccaccc 
gagccacgct 
gggagacgcg 
agagtcaggg 
gggagggcac 
ggacagaaac 
ctccctcact 
ccagacccct 
tggacgctct 
ggactggatc 
gaggccactc 
catggctggc 
ccgagccagc 
ccccttctga 
ggcagaagga 
ccctccagct 
agcagacacc 
agcttgatct 
cctgtcactg 
aaaagagaac 
aactctggag 
tcacaggctt 
tcagcacaac 
atcaattaaa 
agctaaaggg 
acatccatgc 
caacctagat 
atattgtgtg 
aaggtaggtc 
tctggggttt 
tctgtgaaca 
tatgtgaatt 
aaaattttaa 
gacagagtct 



ctgcctgcat 
aggagagcaa 
cacctggatc 
aaccaataag 
aggaataaca 
atcctcatcc 
gggccacaca 
acaccctccg 
ctccactccc 
cagcgaccct 
ccctgaaccc 

t ccccttctgc 



gacctgcatt 

tcctcagcct 
ctccatgttt 
gatgatccag 
gtacaatacg 
cacacaaaaa 
atttctgcat 
aaaaagttgc 
tgatgggacc 
tttgaatcag 
aataacccgc 
aggtccgcac 
gggtcctccc 
aqqaggccgt 
aagcctctgg 
taqacgaggg 
ccaaatgtgg 
cggggcctga 
aactgcaggt 
cctcctgagc 
ctgcctgcag 
cagagccaag 
aggggctgca 
cagtgtcagg 
caggcctttg 
aagacaatgg 
gtaggactcc 
aggcttcgtg 
ggacacctgg 
ccctgtggga 



cagatgcccg 
atgttgccca 
aaagtgctgg 



agaggcagca 
cggcaactgt 
ttaaattata 
agcacacaaa 
tggaaacaac 
aatggaatca 
gaaccttgaa 
attcatttat 
agtggttgcc 
cttttcagag 
tactaaagcc 
atgtctcaat 
aagcatgtgt 
cactctatag 
ccagggttca 
ccaccacgcc 
gggtggtctc 
gattataggc 



agccaagggt 
cagccacacc 
tttttttttc 
cctgactaat 
tcggtccctg 
cacatctaca 
cagctgatgt 
ttgcctctca 
gcaaagcccc 
cctcctcttg 
tgcctcctcc 
ctccctcttc 
acagagcgct 
tccagctgcc 

ctccctcagc 
cctgagtaga 
aaatatttcc 
acccttttgt 
tgttgaaagc 
tctgttttat 
cagccatcgt 
cagcgcaggt 
tcctcacggg 
ccatggtcac 

gcaggccctg 

ctgcctctgt 
cctcggcagc 
cgatgcaaac 
cggtaccagg 
gtccctaagg 
acggtgtgga 
acaggactgg 
actgacagtc 
caggaagtga 
gagcaggtct 
gctgccttcc 
ggccccaggg 
caagctaagg 
gcaggtgtca 
tcctgccaca 
gctccctgga 
catctttact 

cagcaccgct 
gaggctgtaa 
aaagttttac 
ccaatccatc 
tatctttcca 
aacattatgc 
actaatgaaa 
taggcctggg 
tcttgaacat 
attcattgaa 
aacattatta 
atgttgatag 
ccccaactgg 
aacaattctc 
cggctagttc 
aaactcctga 
gtgaggcacc 



gcctcagaga 
tggggatgtg 
tgaagctaga 
ttttttctga 
acactctctc 
agcaaactca 
caagcgcaca 
ctgccccacc 
ctggcccctc 
ctcttggctg 
aacctctctc 

ctttggtgcc 
cccaacccga 
agactccttc 

ccccaggccc 
ttatgctgcc 
attatgtgtg 
cattaagtat 
taatggcact 
atgaaggcac 
gagtcacccc 
ggggagagga 
gccatctcaa 
acaggaggga 
gatccacgcc 
aaatagagct 
cagatgactt 

tccgcaggca 
tggacagaaa 
gggagttaat 
agaggtgcct 
agcagggccg 
cgggaggcag 
tccacaaaac 
cagaggcctg 
gcacatctga 
cttctaaata 
gcagcagtcc 
acagagagaa 
gcatggctga 
ggccacagca 

aactgggaac 

gctgcagacc 
gatgcactcc 
atgaatgttc 
aacccatgag 
tccaaagtaa 
tgagtgaaag 
tgcccagaac 
aggacggggt 
gatctaaaat 
ttttacgtta 
tcaaaaatgt 
ctttattttt 
agtgcagtgg 
gtgcctcagc 
cttgtatttt 
gctcaggcca 
atgcctggcc 



agagcaggcc 

tagttctaga 
agccactttt 
ttctatagct 
agccacctaa 
cgtaaacact 
acagcatgtg 
acctggtcag 
ctgggcctga 
atcctttagc 
acatggtggg 
cgcccaccca 

caagagacag 
ctgtgcaagt 
acagacccaa 
tctattgaaa 
gcatgggcag 
gtttacacat 
gcagggattt 
atacgacagt 
aggactgccg 
gcagaggatg 
ggtggagact 
gtgcagggtc 
ctcccttgag 
tcattgccac 
gtcaaggctg 
ccaggactga 
ggtggggccc 
gaggcctctc 
ctctcacaca 
ggcaagtcct 
tgcccagcat 
aacttgagct 
caaggagccc 
gtgccctcga 
tggaaaatcc 
gctgcgaagg 
aggagggcct 
tactttgggg 
gggcaggcag 
cttgccaggg 
cctgggctcc 
tgaaagctgc 
tgcgcccgtc 
agaccaaatg 
taggtcttat 
ctggcagcat 
tggatttaaa 
tgaagctctg 
aagccagaca 
aggcaaatct 
aattgggggt 
tgtggtgatg 
taagttggta 
gtaaggtata 
ctcttttttt 
tgcgatctca 
ctcctgaata 
agtagaggca 
tcctcctgcc 
gatagcttta 
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169801 
169861 
169921 
169981 
170041 
170101 
170161 
170221 
170281 
170341 
170401 
170461 
170521 
170581 
170641 
170701 
170761 
170821 
170881 
170941 
171001 
171061 
171121 
171181 
171241 
171301 
171361 
171421 
171481 
171541 
171601 
171661 
171721 
171781 
171841 
171901 
171961 
172021 
172081 
172141 
172201 
172261 
172321 
172381 
172441 
172501 
172561 
172621 
172681 
172741 
172801 
172861 
172921 
172981 
173041 
173101 
173161 
173221 
173281 
173341 
173401 
173461 
173521 
173581 



tttcaatcag 
aggggtccat 
actaccagga 
cctatggggt 
ctgctctgct 
tcacaccccg 
aacttcagcc 
ggcagagttg 
ggatgaacgg 
gggccttgaa 
gagcatttcg 
agtgggagaa 
gggacccata 
cgacccactc 
acgaggcctg 
catgtgaccc 
gaacctgggc 
ggcggctgag 
ctttggcccc 
ctggggatgg 
aggacaggag 
aacggccagg 



cttcttgggg 
caccttgact 
cgtgtgtgtg 
gctgtattgg 
ggtgatgtca 
attccatttc 
gggcagggag 
atgcccgccc 
gtgtgtccaa 
cttgtagtaa 



aggagaggga 
gcagcactca 
tgaagcccgc 
ggacccttct 
agatttcctg 
ctggaaaagc 
aaactccgcc 
aaggcaggga 
accgaatcta 
aactcagggc 
tgcccggtgc 
agcctccttg 
tgggcacatc 
gccaccgtgg 

ggctagatgc 
gaaaacatgt 
gagaatcctc 
taaaaagtct 
aaacatccaa 
ctgtcgccca 
cccgggttca 

aggatggtct 
attgcaggcg 
tatctgtggt 
gaagatgcag 
gccttgagga 



atttatttcc 
gggcctcacc 
cagaagacag 
tcccagcatt 
aatgcaaagc 
acacacacta 
aagacaaatg 
acagttgatg 
gaaggtggac 
tgctggttct 
ggtggggcac 
acgggggcag 
gctcctcctg 
ccagagggtc 
gccagccgaa 
aacctgagcc 
ctctctctgc 
ctgaaaccgt 
agccgcttct 
cctgtgggag 
cgaggccagc 
cggccaagtc 
ggctgcctct 
cctgcccgcc 
aattgttttc 
tgtgtgtgtg 
cgggaagagc 
gcggctcctg 
agaacagtaa 
agcatcctac 
aagccccgcc 

caggcccttc 
gagtagttac 
cagggaggga 
agcgcacggg 
gagggagagg 
tctcaaaata 
cagggtaatg 
agagggttcc 
gttctcaaga 



ctgcacaatg 
ctcaaaacag 
ctcgccagcc 
atctttcaaa 

actgaaaggg 
ctgcgttcga 
ggaaatgggc 
cagcattcac 
tctgaaggca 
tgaaaaagaa 
acccccaaaa 
agatgcaaaa 
gcgagagatt 
ggctggagtg 
cgccattctc 

cgatttcctg 
tgagccaccg 
tcatatttgt 
gtcattggga 
cagcaatgct 



tttgtatttt 
aggctggcag 
gaaggtttgc 
caggcaaaag 
ccatgaggcc 
ggccctggtt 
gtcccctcct 
ggtgtgtgtg 
atggtgggta 
gggggcctgt 
ggaagggaga 
ggagaagaca 
acacatcagc 
agaggacaga 
gtcctgctcc 
agtgagaact 
cgggcggctg 
gcgggaggcc 
gccccggttt 
ggacaggggc 
cgcctctggg 
ctcccagctg 
ccgtcctgca 
acgagactct 
aggtgggaag 
tgtgtgtgtc 
tagaggaccc 
gcctcttcca 
cggcctccac 
cccggcctgt 
cacagagctc 
actgtggtcc 
gctgctcagt 
cgtcatcagc 
gggagcagaa 
accttgcagg 
gagagagagg 
agagaaactc 
atagggagca 
atccttgctc 
gaaactggaa 
tcatctgtgg 
ggaaccctga 
ggctcctcag 
ttctcctgaa 
gaccgcagtt 
aaccacaggg 
aaggtcaacc 
agctctactt 
ctgggagatc 
agaagaagca 
tggttgtagc 
gagacaatat 
acatccaaaa 
taaatagaga 
taggtttgtt 
cagtggtgca 
ctgcctcagc 
tttgtatttt 
acctcgtgat 
cacccggcaa 
gccattctag 
gcaaaggatc 
ggggcttggg 



gtagtgtaca 
gagggttcac 
tggaagaggg 
cccagaggtc 
caaggagtct 
ccttcaagtc 
gcacctgcca 
atctttcgga 
gagcaagcag 

ccgcgtgctt 
gggcagatgt 
ctggctttcc 
cgccagccct 
gctgtcctgg 
caccactggg 
gtgcactgcc 
tgggctgccg 
tctgtcactt 
acaggcacca 
ccacaccctc 
aggggtgcgg 
catcccctcg 
tctcccattc 
gatttggtgg 
cacccccagc 



tggct 



gacatggaac 
gtgatgagaa 
ctcagggacc 
cagcttcttc 
cttaggtcat 
caggcactgt 
gctcctgtcc 
gaaatgatga 
caaagtggag 
atcatgggca 
cctggagaca 
tctggggccc 
ggagccacag 
ccatagtgat 
ctctcccaga 
caggcaggac 
ccctgagccc 
gcagtcacag 
gctgcacagg 
cggattcact 
aggaagggat 
ccacactgac 
cagactggaa 
agaaatggcg 
cgggcagctg 
ccatgtcctg 



atct i 



ttctt 



gctggccctc 
ttttaccttt 
aaccaagccc 
ggctgatggc 
ggaggaagaa 
tcaggtgcct 
gcagagactc 
agctctagcc 
atcttgagct 
gtggacatgg 
agtgcagaaa 
tccacattca 
agtgtctcac 
ttcctgccag 

gccgccactc 
gatgcaagag 
cattcaggcc 
gcacctggtt 
tcccacacaa 
tatgaactgc 
aaaggaacca 
aactcatata 
ttgttttgtt 
agcgatctga 
ctcccgagta 
tagtagagac 

gagatttatt 
taacagagag 
tgaagcaggg 
cttgccaggt 



cgccatcagg 
gcttcattta 
cacagatgag 
acgtgttgga 
caaaaactga 
atgaccgttc 
acacagggag 
ccaaggcacc 
ctccagcaca 
ttgtgagaaa 
cccgtggccc 
attatctcat 
tagagtctcc 

agaccacgag 
atccccataa 
agcagggaaa 
aaggtttgtc 
ctcaattccc 
aattactaaa 
agaaagtaac 
aataataaca 
tctgggaacg 
ttttgagaca 
gctcactgca 
gctgggacta 
ggggtttcac 

tcagaaggac 
actgctttgc 
gctgttgcca 
caaccatggg 



tattttgatc 
agctgtgcta 
gttggccagg 
cccccactgt 
agccacccga 
tttcagattc 
ttccaacgtg 
caggcccaca 
tggtcaggag 
gagccatggg 
gctggaaggg 
gaaggcaatg 

cagtcctgtg 
ttgcctgaag 
aaccctcggg 
ccacggggag 
cgcattcccc 
gagtgtggac 
tgtggatggg 
cccaaaggag 
cagagcgctc 
ctccatttcc 
atcttttccg 
tttgtgtgtg 
ccccaggaat 
aggggctctc 
tgacccagga 
gtgatgcctg 
ccccactccc 
acctggctcc 
aaaatatgat 
atcctcatca 
aaacaggagg 
gcgagatcgc 
agtcatggga 
caaagcccat 
tgaatgcgcg 
tggaactgtg 
taggcccggc 
ttttcaaaca 
accagtgtgc 
tgtgaagggt 
aggttgtacc 
cccccgtaga 
accctgtgca 
ggactgagag 



ggaaagccgt 
cactttgggt 
ttagaagcgg 
gacataaaac 
tgcacacgta 
aggaagttca 
caaacatggg 
gagtctcgct 
ggctccgcct 
caggcgcgca 
cgtgttagcc 
aagtgctggg 
tttttgtgat 
taagacatgg 
ggaagggcgc 
tatcaagtgg 
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173641 


ttcqtqqcaq 


ctctqcacat 


qcatqtatqa 


173701 


tttctccctq 


qqaataaaqa 


tgtctaaatt 


173761 


qtcccaqqta 


ttctqaqtaa 


ataatcacct 


173821 


taacaatct 


t t t 


aa ggaaaa 


173881 


aqtctcttaq 


taataaacct 




173941 


ttccqtqaqa 


aaattaaaca 


tcatqaaqat 


174001 


aaaaccttct 


y agggga 


cactacttta 


174061 


actt cccta 


caga g ca 




174121 


atq aaa 






174181 


tttqqqaqqc 


caaqqaaqqc 


ggatcacqaq 


174241 


cqqtqaaacc 


ccqtctctac 


taaaaataca 


174301 


t ta tccca 


ctactctt 




174361 


cttqcaqt a 


q c 


^ccact 9 ^ 0 


174421 


tctcaaaaaa 


aaaaaaaaaa 


acaaaacaaa 


174481 


cctccat aa 


cttcaca at 
at ^^t^a 


t 

gg 


174541 








174601 


accaacacct 


t ta 
a cag a 


aagaaacaca 


174661 


aaaaatcaa 


caggaaaaa 


agcaaaaca 


174721 


cttttctaca 


ggctgagtaa 


ccagaagaaa 


174781 


agaaacaaaa 


caccagcccg 


gcagaa aaa 


174841 


aagc agcag 


3 taaccacca 


ac agcccca 


174901 


gttgggggcc 


g aaccacca 


geca aaag 


174961 


acgcctgtaa 


cccagcac 


ggg a gg cc 


175021 






gg gaaaccc 


175081 


gggcgcggtg 


g ggg g 




175141 


gtgaacccgg 


gaageggage 


tt t 
gcag gag 


175201 


ggcctgggcg 


aaca^att 9 


cct°actccc 


175261 






cc gac ccc 


175321 


cgqactcct 
cggac cc g 


caggecttgg 


ggagccaggt 


1 75381 


gcgagg a cue 


gagccccca 


gag c a ca 


17 54 41 


tacatctacc 




ca geccag 


175501 




tcacca^c 
caccacagc 




175561 


cctccct ca 


ac gccccgg 


^gagac^gee 


175621 


tetgeaaaga 






175681 


cacatttccc 


ctcccccaqt 


acccataatc 






actcccaqac 


aaaacaaaaa 


175801 


ccacctgcac 


aaccttccca 


gcagccctgc 


175861 


agcccgtgac 


agectgggea 


ccccaacagg 


175921 








175981 


tocaccaaaa 


actcaataac 


aqqccaaqqa 


176041 


tcaccctgct 


tqtacaqqtq 


aaaaattatc 


176101 


ggtgctccag 


t t 


t 






t t t 


tttcctccca 


176221 


aqaatqtacc 


aqqaaqqtac 

a gg 




176281 


qaaaqqtccc 
ggg 


aggag cc 


ggc ga gaa 


176341 


agaggacaga 


a gggg ca ggg 


a gggg a g a gg 


176401 


a^^^aa^a 






176461 


a g y gg g aag a 


gagaggaggg 


aaggggagga 


176521 




^^aa^^a^ 


^a^a^a^ 


176581 


aaaaaaaaao 


gggaagggag 


ggagaggagg 


176641 


aaaaaaaaaa 


^^aa^^a^ 


^a^a^a^ 


176701 


aaaaaaaaao 


gggaagggag 


ggagaggagg 


176761 


gacccagcac 






176821 


aactcaggcc 


ttctccctgt 


eggcaggget 


176881 


atcagagccc 


ccaggctggg 


tgaggtccac 


176941 




tgtggtttcc 


ctcacaccaa 


177001 


gtgccccaca 


gttgagtgca 


atgctcacgc 


177061 


ttaaggcetc 


agtcccacac 


agcagcccca 


177121 


gtgcctctga 


ccaggcggct 


ataaattggg 


177181 


taatttgeta 


ttaggttggt 


gcaaaagtaa 


177241 


tcattaatag 


aacagatcac 


agaactcagg 


177301 


tattaaagac 


acaacccagg 


aacagctgea 


177361 


aagggctgea 


gagcctccac 


gccctctctg 


177421 


tcaccaaccc 


aaagctctcc 


aaaccttgtc 



tttgcactaa tgtgtcattt ttaactctgt 
ctgagaatat cattagtctg aaagaagcag 
actgtattat aaattaaaag tgttcccaga 
gaccctattg acaatggcaa taacattcta 
gtttttaaac atctgaatca atcttcttat 
gtcgttcatt tgttcagcaa agatttgetg 
ggcatttttt taaaacctca cgatggtgat 
tcaaccaaca gcaattaaag acccctggac 
cacagtggct catgcctgca atcccagcac 
gtcaggagat ggagaccatc ctggctaaca 
aaaaattagc cgggtgtggt ggcgggcgcc 
aggagaatcg cttgaaccca ggaggcagag 
ctccagcctg ggcaacagag tgagactccg 
acccagaaag gecagaaata aaccccatat 
acattccaaa gtcaagaaga aagagtgaat 
tggcaattta gaaaaaaatg aacacttaaa 
tttcacttga attaaaggag ttacatattt 
ataatggagt atttctttga aaggctgata 
gcaactagag aggctgacag agctgaccga 
aaaggaagct ctgeaaatat aaacccaccc 
gggagaacca atcagcttca gaggaggaaa 
aaacaagttt caggccgggc gtggtggctc 
gaggegggtg gatcatgagg tcaggagatc 
cgtctctact aaaaatacaa aaaattagee 
etacteggga ggctgaggca ggagaatggc 
ccgagattgc gccactgcag tccgcagtcc 
aaaaaaaaaa aaaaaaaaaa aggaaacaag 
accaagggtc ccaggccagc ccagccccag 
gggcacctgc aggcaggttg ggggaaggct 
ggacaggctg acaccaagac agagaeggee 
cctggaccag ccatccctgg ccccaaaggc 
ccaatcaggg ccacctgcaa tgtccagcag 
taatgtccct agtcccgaga atgeagaage 
caacctgact gtggaggcag acacctggca 
tccacccagc gctgggcacc ctccaacctc 
ctggcctcat tggccacctt gcccagcaca 
tcatcgggcc accctcccca gtgaggacac 
aaagacagee caaaggecaa gccgccctat 
gactcagcct tgaccttgtt tgctagaaaa 
gaggagacat gaattcccct tttcagcaat 
tgtacctctg aggaggtaca gatttcatct 
tctgggctct gtgggacctg gggtgggtca 
gcacattggg atggctcgtt gttggttccc 
tgeagctgae gacgaagggc ccccagccag 
agactctggg aaagaaaggt gggaacaggg 
acataaaggg agaatggaga gaaaagggaa 
gggagtggga aaaagaggaa tgagaaaaga 
gaagctggag ggagaggagg gggaagggag 
gggaagggag ggagaggagg gggaagggag 
gggaagggag ggagaggagg gggaagggag 
gggaagggag ggagaggagg gggaagggag 
gggaagggag ggaggggctc ccagcaccag 
ctctgcagct gcatttaggg ctgttctgga 
gagccctggt ctgggacctc cccttgttct 
tgggatctgt tttcactcac aatgcttctg 
ccagttctcc aacctccaga cgctagctgg 
caactacctg gggtcagctt agccctcagg 
cttgcggtgg caagttccag ctttccacct 
cgggtcccac aactccctcc tcaggttctg 
ctgtgacctt ggaatcagac agacatgtgt 
gaggcatttt acttactatt tccagttgat 
tgggagggat gggcagggca aggtaaeggg 
ggcactcacc ctcccagaac ctccacctgt 
atttaggggt tttcatggaa gttccatgac 
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177481 
177541 
177601 
177661 
177721 
177781 
177841 
177901 
177961 
178021 
178081 
178141 
178201 
178261 
178321 
178381 
178441 
178501 
178561 
178621 
178681 
178741 
178801 
178861 
178921 
178981 
179041 
179101 
179161 
179221 
179281 
179341 
179401 
179461 
179521 
179581 
179641 
179701 
179761 
179821 
179881 
179941 
180001 
180061 
180121 
180181 
180241 
180301 
180361 
180421 
180481 
180541 
180601 
180661 
180721 
180781 
180841 
180901 
180961 
181021 
181081 
181141 
181201 
181261 



ttatatcagg 

acaaacatac 
acccaacctg 
gtggccagtg 
gggagaggaa 
agaccaggag 
aggatgccta 
cccatggagc 
ccctgggagc 
gtggccttgg 
ccaggtgaga 
atcccaagcc 
attccctttg 
ttgtgaggca 
gccaggcccc 
cgtcatgctg 
acgcaggtct 
gctcctcaaa 
ttctgcaaga 
gacagagtct 
gctccgcctc 
aggcgtgtgc 
tattggccag 
aagttacaca 
tctttgcttg 
aagagcagga 
taagacacag 
gccactggga 
aaagggcttt 
gctggctccc 
gtaaatgggg 
agaacacaaa 
ttaagagtta 
tagaaaacga 
gcgctgtatt 
atgcgtgtag 
gagatgagga 
agggactcca 
agttctataa 
atggcaaaaa 
taaaaatatg 
aaaaatatac 
ttctaagttt 
ttataaaatt 
ataccagcaa 
tagaactcga 
tttcagaaat 
tcaatacttt 
tgaaatccaa 
agtcttcaat 
cttaatactt 
acaagaacag 
atttgcaact 
accaatatgc 
tcagaatggc 
ttccatgtgc 
gtaacgggaa 
cacctgttca 
atacccgggt 
ttcagattag 
gcacagagct 
actgaggatg 



agaggttggg 
aacaagggga 
ggaaattaca 
atttataatt 
cattgcccca 
gccagccatc 
gggagggagg 
cctcccaggg 
ccactgcccc 
tccaggtggc 
cctcaccccc 
ggatagccat 
gacccgagag 
gggccatgcc 
actgaggcca 
ggtggtgtct 
aagtctggtc 
tacaaaagac 
gataaagcag 
tcccatgccc 
caagccacag 
cactctgctg 
ctgggttcaa 
caccatgccc 
gctggtctca 
ggcgtgagcc 
gaaaaacacc 
gactctgcag 
gaggagaatg 
gggaagaaag 
ccaagaacag 
ccaccttctg 
actgggtttc 
caccccggaa 
agacacattt 
gaattcagag 
aagcagagaa 
ccacggctgt 
tccaaaatta 
caaagatata 
aaatcagtag 
gaaaaaaaat 
tatatataat 
tatgttcctg 



gaagtgaggc 
ctcagaccaa 
aaggttttag 
tcccactatc 
gaagcccaga 
gtgcagagag 
agtgcagaga 
cctgggcacc 

cgccaagtgt 
ttatcctgcc 
gcgacggcgg 
cagagggcca 
cactggccat 

taactgggaa 
tgtgagcccc 
tctcctgtgt 
aatctctcag 
atcagtgggc 
atttggtttt 
cttggctgga 
gagattctcc 
agctaatttt 
aacacctgac 
actgtgcccc 
ttctagaagc 
acagtgccac 
gagggattgc 



ittc 



gctttatatg 
atattttatt 

tgtatatttt 
agccttgtgt 
tgactattag 
aaagatggaa 
gactggcgat 
tcacagctct 
cactagcaag 

gggctgcaga 



gtaaagatga 
ggctttcaaa 
ggtgagggct 
caatcggcgg 



gatattgcct 
ataccggggg 

tcagtgttta 
ttagaattaa 
ccttcctgtg 
ccttgaaata 
acttcatggt 

caatcaaatt 
aataaacata 
atccagccag 
tagccatgtt 
taacccagta 

atactcaggg 
ggcttatagc 
agccttacag 
aagtagaatg 
ggtgtactat 
caagcccctt 
ccttagacaa 
tgctttcaaa 
gcctccacgc 
gctctccaaa 
caggctttaa 
ggaaaaatac 
ggtccctcca 
gtctcaggcc 



tgaattctca 
atctgataat 
gagctctgag 
acagggtctc 
cctcagctca 
gggagggcag 
ggaagggagg 
tgcctgggag 
tgatcaggtc 
ggacagccct 
cgcttgcctg 
ctgccttcct 
ggagccgaca 

aatggagggt 
cactccccac 
agcttttgaa 
gatggggcct 
aggcagagag 
tttctgtttt 
gtgcagtggc 
tgcctcagcc 
tgtattttta 
ctcaagtgat 
gcctacagat 
acgtgggata 
cacctcagag 
tccagagaaa 
tgaaagtgga 

cagcaaagtg 
tctggctttt 



itgaaa 
cgttgccgtt 
cgctctgaag 
cagagtgcaa 
tgagttcaat 
taccaataga 
atcctgattt 

taaattatgt 
cacgagggat 
tgtgaccagc 
atgttccaaa 



tactgtattg 
ttctaataat 
tatctgcggt 



aaaaggtgtt 
gcaggaacca 
agacagtacc 
gggccagtgt 
aggggaggga 
ggaaggaagg 
ggcactgagg 
tgcaggatca 
gcagcaaagg 
tgaggacccc 
tctgccgggg 
ccgctgtggc 
cacctcaagc 

cacaggccag 
agccctcgct 
aagtacagac 
atgtacagat 
atcacacaat 
tttttttttt 
gcaatttcag 
tcccaagtag 
atggagatgg 
ctgcccgcct 
ttcttctaag 
acagataaca 
cagcgggcaa 
ctggacagag 
aatgccccca 
cacccgcgtg 
acgagggctt 
gacagctgag 
ggaggtgaga 

aaagccaggc 
gttccagacg 
ctgtatacct 
caagctgctt 
tggaaagtat 
ttaaaaatgg 
agcagaatga 
.ttcctcc 
tctaccc 
caagaaaatt 
gccacacagt 
aaagacacag 

tttggggggt 
ttcagaccag 
gacagcaaag 
aaacacagct 
actgcttaac 
atagacagat 
gtccagaagc 
aaacgtgaag 
ggagggatgg 



ttcatcatat 
gccttgcatg 
cagccgcgtg 
cctctctggg 
ccttgtcgt 
cacttggaaa 
ccaggtttaa 
acacgcatct 

ccggccctgg cccccgcctc 



taggggtttt 
tacttttaat 
i agatgaatcc 



ccagaaactc 
tctatcaccc 
gagatgaaga 
tcagtctgtg 
gctgggatca 
gggaggggat 
gaaggagagg 
agcaccaagg 
gtgctcaacg 
ccagctactg 
tccctactgt 
ctctgaattt 
agcttcccgc 
cccccatgga 
gacacacacc 
caagagaaga 
agtgagcacc 
cccagggctc 
gctgtgcaaa 
ggaaatgctc 
ttttttttta 
ctcactgcaa 
ctgggactac 
ggtttcacca 
cagcctcctc 
gagggaaatg 
gtgggactgg 
agtccgctgt 
ggagggagac 
cacatggccc 
gccgggaggt 
cccatggtgc 
gccgaggctg 
gaagggagcg 
aatgggatct 
tccacagcaa 
acagatgagg 
gtgcagattg 
agaaaaccca 
gataaaagat 
tcaatgagca 
atgacctgta 
aataagctga 
tttaattagt 
gacaaaacaa 
ttgtaagaga 
ccaacagaaa 
gtgaaattaa 
caacctgtaa 
acgaagtctc 

cttcaaagta 
taatattggt 
ctgaggtttc 
gtaaatacat 
tctgtcccaa 
acaccctact 
gcagggcaag 
cccagcacct 
caaaggaaaa 
acattgaaat 
cggacattct 
caacccctga 
ctgggccacg 
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181441 
181501 
181561 
181621 
181681 
181741 
181801 
181861 
181921 
181981 
182041 
182101 
182161 
182221 
182281 
182341 
182401 
182461 
182521 
182581 
182641 
182701 
182761 
182821 
182881 
182941 
183001 
183061 
183121 
183181 
183241 
183301 
183361 
183421 
183481 
183541 
183601 
183661 
183721 
183781 
183841 
183901 
183961 
184021 
184081 
184141 
184201 
184261 
184321 
184381 
184441 
184501 
184561 
184621 
184681 
184741 
184801 
184861 
184921 
184981 
185041 
185101 



tcacgctgtg 
gtgaggcaaa 
cccccggagg 
caggcagcag 
tggcagtggg 
gtgtctcctc 
tagagctaat 
cttgctgcac 
gctattggag 
gagcacagcg 
ctgggttaac 
cagcagcacc 

cccgaggagc 
ccctgtggac 
ggatcctccc 
gattgagggg 
ttctacagag 
actcaaatgc 
tcagacgtgc 
caagacatgc 
cgcacacgtc 
ggctgcccaa 
acccagggtc 
gtgggcaggc 
cggatgtccc 
ggcctgggtg 
ctcactcacc 
ttcacctccc 
actggagaca 



ctga, 



tga 



atggagtgi 

gactatgccg 

ttccaggaga 

ttcctacctg 

tcaaagcccc 

tggccgccgc 

ggccctcagc 

cctgagggga 

accagcagag 

tgaatgccag 

taagtgaagg 

ggagaagggt 

ggttggcacc 

taggggaggg 

tgagtatggg 

cccaggagga 

cgtgttcacc 

agagtttggg 

gcccgcacca 

gtgtcaccca 

gcctggagcc 

gagaggcagg 

agccaacact 

ccaatatgag 

cggaggctga 

cctctgctcc 

tctttagtct 

gaaggaagac 

tcccaggacc 

tgtgtggcct 

gccggcttct 

ggtgcctggc 



cggacgaagc 
aaatggccca 
gcccgcgcct 
tttcttggtg 
gatatgcaga 
acccaccctt 
gaactaaggt 
gaccatgtga 
ccgtgaaatg 
caggccttgg 
ctctgcagcc 
cagctccggg 
agctgatatt 
attccggggt 
gctggggtcc 
tgcaggaaac 
acaccctccc 
acaaacatat 
agaggcaaca 
cacatgcagc 
gccagagagg 
ccaggagggc 
ccagaccata 
tggctgattc 
aggacctggc 
caggtgaggc 
agctgactct 
tgggggacag 
gcacatggcc 
gctctcccag 
gccttcatga 
gacgccagcc 
tgaaatgggg 
aagcagaggt 
ctccagaggg 
tcacgtaagg 
acgctggagt 
caggcgactc 
gccagtgggt 
cgtgacatga 
tgtcggggag 
aggcgtggag 
cgatcctgga 
gtcatccagg 
cttctaagcc 
accctgcaca 
agtctgtccc 
cagccatggg 
gagaggaggg 
gctggtgggc 
gctgagaaat 
cctcaggacc 
caggggtttc 
gactggcctt 
tacctgggga 
tcctgacaac 
cctaccagcg 

cgaaggcagc 
gacttctctc 
ccagacagca 



cactgtcaac 
gttagggtgc 
tggaggaagg 
caaccgaact 

ccagggaacc 
gcgtataaag 
cacggccacc 
tcactgccac 

cccgcaacag 
cacaaggcag 
tcctggacac 
ctggagtcgt 

taatatgaaa 

tcctctcccc 
ccctcccgag 
atgcagacac 
gcacccccac 
cgtacatgca 
caccagcctg 
caggtggggt 
tgccatgcac 
gctcaggcag 
ccttactcac 
cttcacctcc 
ccctgtgagc 
gcaggagggc 
ggaagccacg 

ggagagctgg 
ggggctggga 
cacacagaga 
ccacggtggc 
agatggtgtg 
actgcgctgc 
cagcagggga 
tctgcctcct 
aggccggagc 
ttggatgttt 
gttgctgcat 
agaccatggg 
ggaagccggg 
agggtggggt 
gaagggggtt 
caacatgggg 
attctcctcc 
tccccctcag 
tttgaagatg 
ctttccatct 
cgggggttga 
ggatagcatg 
cccagataat 
tgtcacttaa 
agcagctgtg 
gccccgacac 
ctctcattgc 
gcagctgcaa 
tgcctgatct 
agccttgctg 
tgtggccttt 
gccccgcagg 



accctggaca 
acgcagaaag 
accgctactt 
cagttcccgg 
tgcagacagc 
acaaggcaca 
taatggtgtg 
gctttctcac 
tgaagtgcgt 
gtggtgcttg 
cctccagccc 
ggtggggatg 
tagggttccc 
cgtgcagtgt 
acttctccat 
catacgcagc 
aggggatggg 
tgagtacctc 
aaatacataa 
actcatgtgc 
acgcacactg 
cacactcccg 
ctgtgccggc 
aaagaaggac 
acttgcctct 
ccacgaggca 
ccaggtgtgg 
cagctgactc 
ctcacagcag 
ctgcattcac 
gcgagccatt 
accagtgcca 
ccttggtggg 
gccggctcca 
acctgcatat 
tggctcacag 
gatgctgctc 
actgctggat 
gggagtcatt 
gcagcccttt 
tggagggccc 
ttaagagcag 
ggtttctaaa 
gcagtggggg 
ccctgagaga 
ctggaggaga 
ccagaggcag 
tctggagtcc 
cttcttgtat 
tcccaaaggt 
ggcctccaat 
caaaggttgc 
ggggtggaga 
ctctgagccc 
tggggccatg 
actgagcaag 
taggctgctt 
cctagggaac 
gatccaggca 
atctcaggtg 
gctctagcac 
ggaaaaatga 
atgactctta 
ggagccccag 



ctcaaaatgt 
gcgcatgtag 
cgtaattcac 
gagccgtgac 
atcagagaaa 
gagttgcagt 
tcctgtaacc 
ccccgggcct 
gcggacagaa 
tccgccctgc 
acaataggcc 
gcgtagagcc 
tgtgcctctg 
aattgaaatt 
attaatggca 
agagtgtgca 
ggtgtctaac 



acacatgctg 
acacacgtgt 
catgcacgta 
ccatgcacag 
agcaacacag 
atgtctgcct 
ttgggtcttc 
aagccagcct 
cccctgtgag 
tgcacgaggg 
gcctgggtga 
tcaggcatga 
gggcaggaga 
gaaacatacc 
ctccggctga 

gtaactccta 
gaaaaccgca 
ccccgacccc 
ccctcaggca 
gccccaccct 
ctcccggagt 
tgaccaggtc 
taagaaccac 
aatccctctg 
gtacaagagc 
gctctagaat 
gtggggagag 
ccgagcaggc 
aggagaaggg 
gataggcccc 
gggcaggaga 

ctccttggaa 
gcaattctga 
ctggattcag 
ctgtcccttg 
tcttaaggtg 
gcagctgggg 
ccagaggttc 
gtgccaaggt 
cccctcaccc 
ctgagcctgc 
aactcatctc 
atgtccagca 
cctgaaaggc 



ctttgaagag 
ctttgcattg 
aagaggaaga 
aagcgacggc 
ccaagcactg 
ttaaatggga 
gatggcacct 
ggggtaggcg 
cccatctttt 
cggtcacgag 
cctgccttcc 
tgcctggctt 
agcggaagct 
catggacaag 
gcaccgcaga 
gctcctacct 
gtggcgccaa 
aaggcccctc 
gcccatgctc 



ctcctacatg 
gcccaggaca 
ggacccccag 
gccagcacag 

cgctgctgag 
cctcacagca 
cctgaattca 
ggacctggcc 
ctcgctaagc 
agcctgccac 
cagccacaca 
gaccagtgct 
ccctaaagga 
gagcctcggt 
gggactgggt 
ctcctgaggc 
tctctgtagg 
cccaggggca 
ctcattccag 
acgggggcct 
tgagaggtct 
gctgcagagt 
agaaagcagg 
tggagctggc 
ccacgtgact 
aagcctggag 
tggccacaac 
tcttcattcc 
cctgggctag 
actctcacag 
aacgagttgg 
aatgaacact 
ccaggcctga 
tttgtttcat 
acaagtatgg 
cccacaggag 
ctgcagcccc 
ggggtgggag 
cactgcacag 
tggagcccag 
gatcccttag 
agacagagtg 
agcagccctg 
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185161 
185221 
185281 
185341 
185401 
185461 
185521 
185581 
185641 
185701 
185761 
185821 
185881 
185941 
186001 
186061 
186121 
186181 
186241 
186301 
186361 
186421 
186481 
186541 
186601 
186661 
186721 
186781 
186841 
186901 
186961 
187021 
187081 
187141 
187201 
187261 
187321 
187381 
187441 
187501 
187561 
187621 
187681 
187741 
187801 
187861 
187921 
187981 



18; 



188101 
188161 
188221 
188281 
188341 
188401 
188461 
188521 
188581 
188641 
188701 
188761 
188821 
188881 
188941 



gatttcttct 
cctatctcac 
ctttgggccc 
tggaggcttc 
ggggaggcag 
accagtgagg 
aaggccactc 
acaatgagaa 
tttttaaaat 
ttggaatcag 
ttggagcctc 
tgctgtgagg 
aatgttaata 
ttctcactgg 
tgcccaggag 
agcctgtttt 
cctcgctgtg 
agtgctccac 
aggtgccctg 
ggggatttaa 
gccgggcgcg 
cgcttgaggt 
gaaggaagga 
ctgggcgcag 
acaaggtcag 
attttaaaaa 
tacttgggag 
gctgagatcg 



tcagtgacag 
cttcccagct 

ccagctggcc 
aagtcctggc 
acctgcctca 
tgaatcacca 
tccatcagga 
actcccacca 
taacctccag 
gcgatgtcaa 
aatatggcca 
atattttctg 
ggctctgcag 
gggggctggg 
gaacccatgg 



cagagcaccg 
ccttgtgctg 
agtcacagga 
ccttcaggcg 
aatgcacccc 
acaggaaacc 
tgggcatttc 
ggaagactga 
acggacacaa 
gcacaagaga 



gacccacccc 
ctcctgcctg 
tcagaccagt 
ctaaagcacg 
ccaggtaacc 
gcgtactcac 
ccacgtctcc 



tgccatgcaa 
agggttgtgg 
caggattgga 
atccactgct 
aggcgggcag 
acttaggcca 
agctcccagg 
cgtctccagg 
tctttttgta 
atttgtgttc 
agtttattca 
attaaacaag 
ggaacacagt 
gtttttaatg 
atactggaaa 

gatattatta 
ctgtaaccag 
tgtaaagacc 
cagtaggaca 
gtggctgagg 
tgggagtttg 
aggaaggaaa 
tggctcatgc 
gagattgaga 
aaataaaaaa 
gctgaggcag 
cgccactgca 
aagaaagaaa 
cattcaatgc 
ctgtctggta 
acctctgcac 

cagagtgtcc 
ttgctgacca 
aatgaactgc 
atgcatagta 
ccttacctca 
cgtaaaatgg 
gggctcacca 
agttctaact 
tgtctcr.ata 
actgtaggat 
agagtggcca 
tgggcggctt 
atagaaaact 
ccaatgtcag 
agccacggtc 
ctagagaggc 
ctgcccccaa 

agcatcactg 
ctgccagccc 
actcaacaga 
cactggcaca 
cgttccacaa 
cggatgcaag 
ctgagccttt 



ggaagggcgc 
cttttccaaa 
cagcccatcg 
gacgtagaga 



ccttgggtct 
ggttcgcatc 
ccaggccttg 
gcccagcctg 
ggttgcttcc 
cccataggag 
cctgctcccg 
agtcaaataa 
ggcatgcagg 
aaatactggc 
tctgtagaag 
gcgaggtggc 

ggcatgtaaa 

tgtgcaaagg 
tgaataactt 
ggatgagatg 
caggtcctta 
gacacatacg 
cctgtaatcc 
agaccagcct 
gagaagagaa 
ctgtaatccc 
ccatcctggc 
aaattagctg 
gagaatggcg 
ctccagcctg 
acagaaccaa 
cgctcaggca 
catgggacac 
acgctcctcc 

gtcagccacc 
gtacagggga 
tgaatgaatg 
cacaggtgtt 
ccctaagtcc 
gacacacctc 
ttttaaccca 
aaatccagcc 
gaaagaatga 
gagcctcagc 
ctcactcggg 
cagatgtctg 
gcagtggcct 

cctcctgccc 
gcaggcagtc 
aagatcgagc 

aaccaatgat 
ccagctccct 
ataagtccac 
tgctcaacac 
cgcacatgtt 
cacatgtgca 
ccagagacct 
tggagacccg 
tgctttccag 
ggaaagtcct 
acccaggcga 
cagttgctga 
cagctcccga 



ctctgagcct 
aagcatccag 
gagggcggac 
gcaggagagg 
atcccacgga 
ctgcagccca 
gctccactca 
gaaaggtggg 
aaaaggcaac 
ttcgaagctt 
ggggatgatg 
acagggggct 
ttaccatcat 
aaccacgtga 
agcaacgtga 

ccgtctagtc 

gcaaagaagc 
cagcaacaaa 
cagcacttta 
gatcaaaagg 
gagaaagagg 
agcactttag 
taacacggtg 
cacgtggtgg 
tgaacccgga 
ggtaatagag 
tggtgctgtc 
ggtcccagct 
ttcagcaggc 
ctctgcccgg 
ttcaaacagc 
ctgacgacag 

ggcccaccaa 
gcagcaaaca 
tgaggcccag 
gctcctccag 
aaaggcagtg 
cagtggttgc 
tgcccgagta 
gcaggagttc 
gccagcacag 
ctgctgtgtg 
cagcatcgat 
tctccaggaa 
tcaaaaggga 
agctcatggg 

tcccggtcag 
gggctggtat 
cacacacata 
actgatgtgc 
cacacaggtg 
cacactcaca 
gttcaggcag 
ggcgctcccc 
ccaagccctt 
cagcgcaggc 
gagcagcgga 
gcgccacaga 
gcaggggctc 



cagccacgca 
ggaagggcat 
catctccacc 
atccccgtgc 
agtcccagct 
gcgcaggtca 
cactgacgat 
aaatgttgta 

gaagcaagct 
tcgggatatg 
cagctgagag 
taccttttac 
aattagcacg 
ggcacaacac 
tgtggatact 
actcaacctc 
cagggctctg 
ctatcgtggt 
gtcagaaaag 
ggaggccaag 
aaagaaagaa 
acaagcgaaa 
gaggccgagg 
aaacaccgtc 
cgggcgcctg 
aggcagagcg 
tgagactccg 
ccactgtcca 

ctgtcctgcc 
gttgccaacc 
cccccaggag 
gagccctggg 
agaaggtgct 

cagactgcaa 
tctcctttca 
gatgtcctca 
ggatatttac 
attttccaaa 
agagccctgg 
cgggcgtgct 
cctgagtagc 
ccagcctgac 
ccaggccccg 
ccaggcttcc 
gcagagcttc 

cactagagcc 
aaagaaccag 
ccaaggggca 
tacccgcacg 
tcacgccagt 
agaaacccag 
cacgggcacc 
aaggtacaca 
gcaaaagaga 
atgcataaat 
cagttggagc 

gagaaaggag 
ggaggcttcg 
cacgcgggtg 



gtggggcagc 

gcccagccca 
tgccctgagt 
gcagtccccc 
cgagccagtc 
ggtcaaaccc 
aagaaatgca 
gagcatggcc 
atctcgactt 
ttccgtaggg 
cctggcacac 
gccaaaaggc 
cctaccacgc 
cactctgtgc 
atgaataacc 
ccagagcctg 
tgaggacaag 
ggcagcagag 
gaaagaagag 
atgggcagat 
agaaaaaaag 
agaaaagagg 
caggcagatc 
tctactaaaa 
tagtccctgc 
cttgcagtga 
tctcaaaaaa 
aggccattat 
ccccaatatt 
caggccaatc 
tgcgtctttt 
cacactctcc 
tgggtaccag 



ccccatgtca 
tttaggaaat 
gagacagtgg 
agttgtttgc 
gcaacaagtg 
tgtaattgcc 
caggggccac 
aattagagcg 

ggcagccgga 
tctgaggccc 

tctctcttgt 
ttcatttcca 
ctggagttca 
gagggctaac 
caaggaaaca 
atgcacaagg 
acacatacgt 
cagatgcagg 
cacatacacg 
gaggcgcacg 
ggccccctgc 
acgaggctca 
caagggactc 
ctagagaagt 
gctaggccgg 
gtgcccttct 
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189001 cgccctgcca ctccatagtg gccccgggcg ccacgaggcg ggcgttgggc cgcagccgca 

189061 ggtgcaggtc tcggccaaag accgtgacat tgtagaagag gtgRctgcca ggctcctcct 

189121 cgttgcctcc gggRaagctc ggggtccgga ccggggcggc cctgcgggct cgtacccctg 

189181 ctctggacgt agctgccgac accaYgtggg acaccaagcg gccctgggcg tcagtgcgca 

189241 cgggcaccgc caggatgcgc tccgctccgt gccccagggg cccgcctgca acgggaaggg 

189301 gcgttagatc ggcggagacc acggagcccc agtgcctcag agacccgccg gcaagccacg 

189361 cccccccaga ccccgcccca ctgcgaaggg aaggggcatt ccgccaggcg accccagaag 

189421 ccagcctgca cctccccggc tttcctgcaa 



[0254] Following are human cDNA sequences for transcript variant 1 (long form) and transcript 
variant 2 (short form) of ADAMTS2 (cDNA sequences 1 and 2, respectively). Alternative splicing of the 
ADAMTS2 gene generates two transcript variants, therefore, ADAMTS2 exists in two forms: a "long" 
form comprising a molecule approximately 130 kDa in length, and a "short" form comprising a molecule 
approximately 70 kDa in length. 



ADAMTS2 cDNA Sequence 1 (SEP ID NO. 2) 

NM_0 1 4244 Homo sapiens a disintegrin-like and metalloprotease (reprolysin type) with thrombospondin 
type 1 motif, 2 (ADAMTS2), transcript variant 1, mRNA 

1 atggatccgc cggcgggagc cgctcgccgc ctgctctgcc ccgcgctgct gctgctgctg 

61 ctgctgctgc cgccgccgct cctgccgccg ccgccgccgc ccgcgaacgc caggctcgcc 

121 gccgccgccg accccccagg cgggcccctg gggcacggag cggagcgcat cctggcggtg 

181 cccgtgcgca ctgacgccca gggccgcttg gtgtcccacg tggtgtcggc agctacgtcc 

241 agagcagggg tacgagcccg cagggccgcc ccggtccgga ccccgagctt ccccggaggc 

301 aacgaggagg agcctggcag tcacctcttc tacaatgtca cggtctttgg ccgagacctg 

361 cacctgcggc tgcggcccaa cgcccgcctc gtggcgcccg gggccactat ggagtggcag 

421 ggcgagaagg gcaccacccg cgtggagccc ctgctcggga gctgtctcta cgtcggagac 

481 gtggccggcc tagccgaagc ctcctctgtg gcgctcagca actgcgatgg gctggctggt 

541 ctgatccgga tggaggagga ggagttcttc atcgaaccct tggagaaggg gctggcggcg 

601 caggaggctg agcaaggccg tgtgcatgtg gtgtatcgcc ggccacccac gtcccctcct 

661 ctcggggggc cacaggccct ggacacaggg gcctccctgg acagcctgga cagcctcagc 

721 cgcgccctgg gcgtcctaga ggagcacgcc aacagctcga ggcggagggc acgcaggcat 

781 gctgcagacg atgactacaa catcgaggtc ctgctgggcg tggatgactc tgtggtgcag 

841 ttccacggga aggagcacgt acagaagtac ctgctgacac tcatgaacat tgtcaatgaa 

901 atctaccatg acgagtcctt gggtgcccac atcaacgtgg tcctggtgcg gatcatcctc 

961 ctgagctatg gaaagtccat gagcctcatc gagatcggga acccctctca gagcctggag 

1021 aatgtctgcc gctgggccta cctccagcag aagccagaca cgggccacga tgaataccac 

1081 gatcacgcca tcttcctcac acggcaggac tttgggcctt ccggcatgca aggctatgct 

1141 cctgtcaccg gcatgtgcca tccggtccgc agctgcaccc tgaaccatga ggacggcttc 

1201 tcctcagcgt ttgtggtggc ccatgagact ggccacgtgc tgggcatgga gcacgacggg 

1261 cagggcaacc gctgtggcga cgaggtgcgg ctgggcagca tcatggcgcc cctggtgcag 

1321 gccgccttcc accgcttcca ctggtcccgc tgcagccagc aggagctgag ccgctacctg 

1381 cactcctatg actgcctgct ggatgacccc ttcgcccacg actggccggc gctgccccag 

1441 ctcccgggac tgcactactc catgaacgag caatgccgct ttgacttcgg cctgggctac 

1501 atgatgtgca cggcgttccg gacctttgac ccctgcaagc agctgtggtg cagccatcct 

1561 gacaacccct acttttgcaa gaccaagaag gggcccccct tggacgggac tatgtgtgca 

1621 cctggcaagc attgttttaa aggacactgc atctggctga cacctgacat cctcaaacgg 

1681 gacggcagct ggggcgcttg gagtccgttt ggctcctgct cacgtacctg tggcacgggc 

1741 gtgaagttca ggacccgcca gtgtgacaac ccacacccgg ccaacggggg ccgcacctgc 

1801 tcgggccttg cctacgactt ccagctctgc agccgccagg actgccccga ctccctggct 

1861 gacttccgcg aggagcagtg ccgccagtgg gacctgtact tcgagcacgg cgacgcccag 

1921 caccactggc tgccccacga gcaccgggat gccaaggaga gatgccacct gtactgcgag 

1981 tccagggaga ccggggaggt ggtgtccatg aagcgcatgg tgcatgatgg gacgcgctgc 

2041 tcctacaagg acgccttcag cctctgtgtg cgcggggact gcaggaaggt gggctgtgac 

2101 ggtgtgatcg gctccagcaa gcaggaagac aagtgtggcg tgtgcggagg ggacaacagc 
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2161 cactgcaaag tggtcaaggg cacgttcaca cggtcaccca agaagcatgg 
2221 atgtttgaga tccctgcagg agccagacac ctgctcattc aggaggtaga 
2281 caccatctgg ccgtcaagaa cctggagaca ggcaagttca tcttaaatga 
2341 gtggatgcca gttccaaaac cttcattgcc atgggcgtgg agtgggagta 
2401 gacggccggg agacgctgca gaccatgggc cccctccacg gcaccatcac 
24 61 atcccggtgg gagacacccg ggtctcactg acgtacaaat acatgatcca 
2521 ctgaatgtcg atgacaacaa cgtcctggaa gaggactctg tggtctacga 
2581 aagaagtggt ctccgtgctc caagccctgt ggcggagggt cccagttcac 
2641 tgccgccgga ggctggacca caagatggta caccgtggct tctgtgccgc 
2701 cccaaagcca tccgcagagc gtgcaaccca caggaatgct cccagccagt 
27 61 ggcgaatggg agccatgtag ccagacctgt gggcggacag gcatgcaggt 
2821 cgctgcattc agccgctaca cgacaacacc acccgctccg tgcacgccaa 
2881 gacgcccggc ccgagagccg ccgggcctgc agccgcgagc tctgccctgg 
2941 gccgggccct ggtcccagtg ctcagtaacc tgtggcaacg gcacccagga 
3001 ccctgccgca ccgcggacga cagcttcggc atctgccagg aggagcgtcc 
3061 aggacctgca ggcttggccc ctgtccccga aacatctcag atccctccaa 
3121 gtagttcagt ggctgtcccg cccggacccc gactcgccca tccggaagat 
3181 ggccactgcc aaggcgacaa gtcaatattc tgtaggatgg aagtcttgtc 
3241 tccatcccag gctacaacaa gctgtcctgc aagtcctgta acctgtacaa 
3301 aacgtggagg gcaggataga gccaccgcct gggaagcaca acgacattga 
3361 cctaccctcc cagtgcccac tgtagccatg gaggtgcggc catcaccaag 
3421 gaggtccctc tcaatgcctc cagcaccaat gccacagagg atcacccaga 
3481 gtagatgaac cctacaaaat ccatggcctg gaagatgaag tccagccacc 
3541 cctcgacgac cgagccccta tgaaaagacc agaaaccaaa gaatccaaga 
3601 gagatgcgga agaaagagat gctcggaaag ttctaa 



ADAMTS2 cDNA Sequence 2 f SEP ID NO. 3^ 

NM_021599 Homo sapiens a disintegrin-like and metalloprotease (reprolysin type) with thrombospondin 
type 1 motif, 2 (ADAMTS2), transcript variant 2, mRNA 

1 atggatccgc cggcgggagc cgctcgccgc ctgctctgcc ccgcgctgct gctgctgctg 
61 ctgctgctgc cgccgccgct cctgccgccg ccgccgccgc ccgcgaacgc caggctcgcc 
121 gccgccgccg accccccagg cgggcccctg gggcacggag cggagcgcat cctggcggtg 
181 cccgtgcgca ctgacgccca gggccgcttg gtgtcccacg tggtgtcggc agctacgtcc 
241 agagcagggg tacgagcccg cagggccgcc ccggtccgga ccccgagctt ccccggaggc 
301 aacgaggagg agcctggcag tcacctcttc tacaatgtca cggtctttgg ccgagacctg 
361 cacctgcggc tgcggcccaa cgcccgcctc gtggcgcccg gggccactat ggagtggcag 
421 ggcgagaagg gcaccacccg cgtggagccc ctgctcggga gctgtctcta cgtcggagac 
481 gtggccggcc tagccgaagc ctcctctgtg gcgctcagca actgcgatgg gctggctggt 
541 ctgatccgga tggaggagga ggagttcttc atcgaaccct tggagaaggg gctggcggcg 
601 caggaggctg agcaaggccg tgtgcatgtg gtgtatcgcc ggccacccac gtcccctcct 
661 ctcggggggc cacaggccct ggacacaggg gcctccctgg acagcctgga cagcctcagc 
721 cgcgccctgg gcgtcctaga ggagcacgcc aacagctcga ggcggagggc acgcaggcat 
781 gctgcagacg atgactacaa catcgaggtc ctgctgggcg tggatgactc tgtggtgcag 
841 ttccacggga aggagcacgt acagaagtac ctgctgacac tcatgaacat tgtcaatgaa 
901 atctaccatg acgagtcctt gggtgcccac atcaacgtgg tcctggtgcg gatcatcctc 
961 ctgagctatg gaaagtccat gagcctcatc gagatcggga acccctctca gagcctggag 
1021 aatgtctgcc gctgggccta cctccagcag aagccagaca cgggccacga tgaataccac 
1081 gatcacgcca tcttcctcac acggcaggac tttgggcctt ccggcatgca aggctatgct 
1141 cctgtcaccg gcatgtgcca tccggtccgc agctgcaccc tgaaccatga ggacggcttc 
1201 tcctcagcgt ttgtggtggc ccatgagact ggccacgtgc tgggcatgga gcacgacggg 
1261 cagggcaacc gctgtggcga cgaggtgcgg ctgggcagca tcatggcgcc cctggtgcag 
1321 gccgccttcc accgcttcca ctggtcccgc tgcagccagc aggagctgag ccgctacctg 
1381 cactcctatg actgcctgct ggatgacccc ttcgcccacg actggccggc gctgccccag 
1441 ctcccgggac tgcactactc catgaacgag caatgccgct ttgacttcgg cctgggctac 
1501 atgatgtgca cggcgttccg gacctttgac ccctgcaagc agctgtggtg cagccatcct 
1561 gacaacccct acttttgcaa gaccaagaag gggcccccct tggacgggac tatgtgtgca 
1621 cctggcaagt tcaggccggg cgcggtggct catgcctgtt atcccagcac tttgggaggc 
1681 caaggtaggt ggatcgcctg a 
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ttacatcaag 
cgccaccagc 
agagaatgac 
cagagacgag 
cgttctggtc 
tgaggactca 
gtgggccctg 
caagtatggc 
cctctcgaag 
gtgggtcaca 
gcgctccgtg 
gcactgcaat 
tcgttggcga 
gcggccagtg 
tgagacagcg 
gaagagctac 
ctcgtcaaag 
ccgctattgc 
caacctcacc 
cgtgttcatg 
cacccccctg 
aaccaatgcc 
caacctaatc 
gctcattgat 
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[0255| Following are human polypeptide sequences for isoform 1 (long form) and isoform 2 (short 
form) of ADAMTS2 (amino acid sequences 1 and 2, respectively). 

ADAMTS2 Amino Acid Sequence 1 (SEQ ED NO. 4) 

NP_055059 a disintegrin and metalloprotease with thrombospondin motifs-2 isoform 1; procollagen I N- 
proteinase; Procollagen N-endopeptidase [Homo sapiens] 

MDPPAGAARRLLCPALLLLLLLLPPPLLPPPPPPANARLAAAADPPGGPLGHGAERILAV 

PVRTDAQGRLVSHVVSAATSRAGVRARRAAPVRTPSFPGGNEEEPGSHLFYNVTVFGR 

DLHLRLRPNARLVAPGATMEWQGEKGTTRVEPLLGSCLYVGDVAGLAEASSVALSNC 

DGLAGLIRMEEEEFFIEPLEKGLAAQEAEQGRVHVVYRRPPTSPPLGGPQALDTGASLDS 

LDSLSRALGVLEEHANSSRRRARRHAADDDYNIEVLLGVDDSVVQFHGKEHVQKYLLT 

LMNIVNEIYHDESLGAHINVVLVRIILLSYGKSMSLIEIGNPSQSLENVCRWAYLQQKPD 

TGHDEYHDHAIFLTRQDFGPSGMQGYAPVTGMCHPVRSCTLNHEDGFSSAFVVAHETG 

HVLGMEHDGQGNRCGDEVRLGSIMAPLVQAAFHRFHWSRCSQQELSRYLHSYDCLLD 

DPFAHDWPALPQLPGLHYSMNEQCRFDFGLGYMMCTAFRTFDPCKQLWCSHPDNPYF 

CKTKKGPPLDGTMCAPGKHCFKGHCIWLTPDILKJRDGSWGAWSPFGSCSRTCGTGVKF 

RTRQCDNPHPANGGRTCSGLAYDFQLCSRQDCPDSLADFREEQCRQWDLYFEHGDAQ 

HHWLPHEHRDAKERCHLYCESRETGEVVSMKRMVHDGTRCSYKDAFSLCVRGDCRK 

VGCDGVIGSSKQEDKCGVCGGDNSHCKVVKGTFTRSPKKHGYIKMFEIPAGARHLLIQE 

VDATSHHLAVKNLETGKFILNEENDVDASSKTFIAMGVEWEYRDEDGRETLQTMGPLH 

GTITVLVIPVGDTRVSLTYKYMmEDSLNVDDNNVLEEDSVVYEWALKKWSPCSKPCG 

GGSQFTKYGCRRRLDHKMVHRGFCAALSKPKAIRRACNPQECSQPVWVTGEWEPCSQ 

TCGRTGMQVRSVRCIQPLHDNTTRSVHAXHCNDARPESRRACSRELCPGRWRAGPWS 

QCSVTCGNGTQERPVPCRTADDSFGICQEERPETARTCRLGPCPRNISDPSKKSYVVQW 

LSRPDPDSPIRJCISSKGHCQGDKSIFCRMEVLSRYCSIPGYNKLSCKSCNLYNKLTNVEG 

RIEPPPGKHNDEDVFMPTLPVPTVAMEVRPSPSTPLEVPLNASSTNATEDHPETNAVDEP 

YKfflGLEDEVQPPNLIPRRPSPYEKTRNQRIQELIDEMRKKEMLGKF 



ADAMTS2 Amino Acid Sequence 2 (SEP ID NO. 5) 

NP 067610 a disintegrin and metalloprotease with thrombospondin motifs-2 isoform 2; procollagen I N- 
proteinase; Procollagen N-endopeptidase [Homo sapiens] 

MDPPAGAARRLLCPALLLLLLLLPPPLLPPPPPPANARLAAAADPPGGPLGHGAERILAV 

PVRTDAQGRLVSHVVSAATSRAGVRARRAAPVRTPSFPGGNEEEPGSHLFYNVTVFGR 

DLHLRLRPNARLVAPGATMEWQGEKGTTRVEPLLGSCLYVGDVAGLAEASSVALSNC 

DGLAGLIRMEEEEFFIEPLEKGLAAQEAEQGRVHVVYRRPPTSPPLGGPQALDTGASLDS 

LDSLSRALGVLEEHANSSRRRARRHAADDDYNIEVLLGVDDSVVQFHGKEHVQKYLLT 

LJVWIVNEIYHDESLGAHINVVLVRIILLSYGKSMSLrEIGNPSQSLENVCRWAYLQQKPD 

TGHDEYHDHAIFLTRQDFGPSGMQGYAPVTGMCHPVRSCTLNHEDGFSSAFWAHETG 

HVLGMEHDGQGNRCGDEVRLGSIMAPLVQAAFHRFHWSRCSQQELSRYLHSYDCLLD 
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DPFAHDWPALPQLPGLHYSMNEQCRFDFGLGYMMCTAFRTFDPCKQLWCSHPDNPYF 
CKTKKGPPLDGTMCAPGKFRPGAVAHACYPSTLGGQGRWIA 



[0256] Modifications may be made to the foregoing without departing from the basic aspects of the 
invention. Although the invention has been described in substantial detail with reference to one or more 
specific embodiments, those of skill in the art will recognize that changes may be made to the 
embodiments specifically disclosed in this application, yet these modifications and improvements are 
within the scope and spirit of the invention, as set forth in the aspects which follow. All publications or 
patent documents cited in this specification are incorporated herein by reference as if each such 
publication or document was specifically and individually indicated to be incorporated herein by 
reference. 

[0257] Citation of the above publications or documents is not intended as an admission that any of 
the foregoing is pertinent prior art, nor does it constitute any admission as to the contents or date of these 
publications or documents. U.S. patents and other publications referenced herein are hereby incorporated 
by reference. 
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What is claimed is : 

1 . A method for identifying a subject at risk of osteoarthritis, which comprises detecting the 
presence or absence of one or more polymorphic variations associated with osteoarthritis in a nucleic acid 
sample from a subject, wherein the one or more polymorphic variations are detected in a nucleotide 
sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-3; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-3; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-3; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c); 

whereby the presence of the polymorphic variation is indicative of the subject being at risk of 
osteoarthritis. 

2. The method of claim 1, which further comprises obtaining the nucleic acid sample from the 

subject. 

3. The method of claim 1 , wherein the one or more polymorphic variations are detected within 
a region spanning chromosome positions 178746000 to 178751000 in human genomic DNA. 

4. The method of claim 1 , wherein the one or more polymorphic variations are detected at one 
or more positions selected from the group consisting of 210, 3608, 3609, 431 8, 5593, 5629, 5639, 5640, 
8943, 17968, 19887, 21034, 21085, , 21596, 23379, 23432, 24007, 26121, 26273, 26755, 2741 1, 27710, 
27842, 28379, 29603, 31232, 31504, 32583, 32794, 32840, 33044, 33150, 33218, 33513, 33959, 34486, 
36289, 36570, 38247, 38477, 38518, 38529, 38667, 39781, 39856, 39927, 40506, 41869, 42452, 44788, 
46059, 46846, 47712, 48796, 49441, 49602, 49723, 50050, 50171, 50477, 50818, 50833, 50881, 50882, 
51386, 51534, 52317, 52368, 52970, 53023, 53356, 53882, 54553, 55475, 55530, 55691, 55848, 55879, 
56316, 56911, 57320, 57391, 57437, 57478, 57500, 59111, 59333, 59715, 59804, 59851, 59929, 60052, 
60240, 60359, 60381, 60456, 60724, 60875, 60968, 60978, 60998, 61557, 62091, 62645, 62943, 63131, 
63145, 63406, 63427, 63554, 63661, 64093, 64153, 64409, 64544, 65257, 65626,65739 , 66392, 66720, 
69177, 69336, 69636, 69823, 69928, 70547, 70633, 71805, 72181, 72200, 72474, 72567, 72973, 73468, 
73889, 75730, 75970, 76114, 76342, 76449, 76465, 76791, 78042, 80758, 80778, 81356, 81576, 81689, 
81759, 81950, 82562, 83591, 83700, 83821, 83842, 83923, 83929, 84021, 84175, 84417, 84747, 85746, 
86129, 86335, 87315, 87648, 87764, 87770, 88221, 90474, 91 148, 91 150, 91160, 91733, 91772, 91785, 
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93140, 93148, 96080, 96157, 96313, 96759, 97026, 97320, 97732, 98713, 99707, 99959, 100009, 
100020, 100065, 100086, 101270, 101276, 101371, 101376, 101439, 101820, 102392, 102602, 102604, 
102896, 189104, 189134 and 189205. 

5. The method of claim 1 , wherein the one or more polymorphic variations are detected at one 
or more positions in SEQ ID NO: 1 selected from the group consisting of 5640, 33150, 38247, 38529, 
46846, 49723, 50050, 63427, 73889, 189104 andrs428901. 

6. The method of claim 1 , wherein the one or more polymorphic variations are detected at one 
or more positions in linkage disequilibrium with one or more positions in claim 3, 4 or 5. 

7. The method of claim 1, wherein detecting the presence or absence of the one or more 
polymorphic variations comprises: 

hybridizing an oligonucleotide to the nucleic acid sample, wherein the oligonucleotide is 
complementary to a nucleotide sequence in the nucleic acid and hybridizes to a region adjacent to the 
polymorphic variation; 

extending the oligonucleotide in the presence of one or more nucleotides, yielding extension 
products; and 

detecting the presence or absence of a polymorphic variation in the extension products. 

8. The method of claim 1, wherein the subject is a human. 

9. The method of claim 8, wherein the subject is a human female. 

10. The method of claim 8, wherein the subject is a human male. 

1 1. A method for identifying a polymorphic variation associated with osteoarthritis proximal to 
an incident polymorphic variation associated with osteoarthritis, which comprises: 

identifying a polymorphic variation proximal to the incident polymorphic variation associated 
with osteoarthritis, wherein the polymorphic variation is detected in a nucleotide sequence selected from 
the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-3; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-3; 
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(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-3; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 

variation; 

determining the presence or absence of an association of the proximal polymorphic variant with 
osteoarthritis. 

12. The method of claim 11, wherein the incident polymorphic variation is at one or more 
positions in claim 3, 4 or 5. 

13. The method of claim 11, wherein the proximal polymorphic variation is within a region 
between about 5 kb 5' of the incident polymorphic variation and about 5 kb 3' of the incident 
polymorphic variation. 

14. The method of claim 11, which further comprises determining whether the proximal 
polymorphic variation is in linkage disequilibrium with the incident polymorphic variation. 

15. The method of claim 11, which further comprises identifying a second polymorphic variation 
proximal to the identified proximal polymorphic variation associated with osteoarthritis and determining 
if the second proximal polymorphic variation is associated with osteoarthritis. 

16. The method of claim 15, wherein the second proximal polymorphic variant is within a region 
between about 5 kb 5' of the incident polymorphic variation and about 5 kb 3' of the proximal 
polymorphic variation associated with osteoarthritis. 

17. An isolated nucleic acid comprising a nucleotide sequence selected from the group consisting 

of: 

(a) a nucleotide sequence in SEQ ID NO: 1-3; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-3; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-3; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 

(e) a nucleotide sequence complementary to the nucleotide sequences of (a), (b), (c), or (d); 
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wherein the nucleotide sequence comprises a polymorphic variation associated with osteoarthritis 
selected from the group consisting of a cytosine at position 5640, a cytosine at position 33150, an 
adenine at position 38247, a thymine at position 38529, an adenine at position 46846, a cytosine at 
position 49723, a cytosine at position 50050, a cytosine a position 63427, a guanine at position 73889, a 
thymine at position 189104, and an adenine at position rs428901 . 

18. An oligonucleotide comprising a nucleotide sequence complementary to a portion of the 
nucleotide sequence of (a), (b), (c), or (d) in claim 17, wherein the 3' end of the oligonucleotide is 
adjacent to a polymorphic variation associated with osteoarthritis. 

19. A microarray comprising an isolated nucleic acid of claim 17 linked to a solid support. 

20. An isolated polypeptide encoded by the isolated nucleic acid sequence of claim 17. 

21. A method for identifying a candidate therapeutic for treating osteoarthritis, which comprises: 

(a) introducing a test molecule to a system which comprises a nucleic acid comprising a 
nucleotide sequence selected from the group consisting of: 

(i) a nucleotide sequence in SEQ ID NO: 1 -3; 

(ii) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide 
sequence in SEQ ID NO: 1-3; 

(iii) a nucleotide sequence which encodes a polypeptide that is 90% or more identical 
to the amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-3; 

(iv) a fragment of a nucleotide sequence of (a), (b), or (c); or 

introducing a test molecule to a system which comprises a protein encoded by a nucleotide 
sequence of (i), (ii), (iii), or (iv); and 

(b) determining the presence or absence of an interaction between the test molecule and the 
nucleic acid or protein, 

whereby the presence of an interaction between the test molecule and the nucleic acid or protein 
identifies the test molecule as a candidate therapeutic for treating osteoarthritis. 

22. The method of claim 21, wherein the system is an animal. 

23. The method of claim 21, wherein the system is a cell. 
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24. The method of claim 21, wherein the nucleotide sequence comprises one or more 
polymorphic variations associated with osteoarthritis. 

25. The method of claim 24, wherein the one or more polymorphic variations associated with 
osteoarthritis are at one or more positions in claim 3, 4 or 5. 

26. A method for treating osteoarthritis in a subject, which comprises contacting one or more 
cells of a subject in need thereof with a nucleic acid, wherein the nucleic acid comprises a nucleotide 
sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-3; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-3; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ED NO: 1-3; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c); and 

(e) a nucleotide sequence complementary to the nucleotide sequences of (a), (b), (c), or (d); 
whereby contacting the one or more cells of the subject with the nucleic acid treats the 

osteoarthritis in the subject. 

27. The method of claim 26, wherein the nucleic acid is RNA or PNA. 

28. The method of claim 27, wherein the nucleic acid is duplex RNA. 

29. A method for treating osteoarthritis in a subject, which comprises contacting one or more 
cells of a subject in need thereof with a protein, wherein the protein is encoded by a nucleotide sequence 
which comprises a polynucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-3; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-3; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ED NO: 1-3; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c); 

whereby contacting the one or more cells of the subject with the protein treats the osteoarthritis 
in the subject. 
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30. The method of claim 29, wherein the treatment comprises administration of an effective 
amount of a composition comprising an active ADAMTS2 polypeptide or fragment thereof, wherein the 
polypeptide fragment is selected from the group consisting of: 252-121 1, 253-121 1, 254-121 1, 255-121 1, 
256-121 1, 257-121 1, 258-121 1, 259-121 1 or 260-121 1 of SEQ ED NO: 4. 

3 1 . The method of claim 30, wherein the polypeptide or fragment has biological activity. 

32. A method for treating osteoarthritis in a subject, which comprises: 

detecting the presence or absence of one or more polymorphic variations associated with 
osteoarthritis in a nucleic acid sample from a subject, wherein the one or more polymorphic variation are 
detected in a nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ED NO: 1-3; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-3; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ DO NO: 1-3; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 

administering an osteoarthritis treatment to a subject in need thereof based upon the presence or 
absence of the one or more polymorphic variations in the nucleic acid sample. 

33. The method of claim 30, wherein the one or more polymorphic variations are detected at one 
or more positions in claim 3, 4 or 5. 

34. The method of claim 30, wherein the treatment is selected from the group consisting of 
administering a corticosteroid, a corticosteroid, a nonsteroidal anti -inflammatory drug (NSAID), a 
cyclooxygenase-2 (COX-2) inhibitor, an antibody, a glucocorticoid, hyaluronic acid, chondrotin sulfate, 
glucosamine or acetaminophen; prescribing a heat/cold regimen or a joint protection regimen; performing 
joint surgery; prescribing a weight control regimen; and combinations of the foregoing. 

35. A method for detecting or preventing osteoarthritis in a subject, which comprises: 
detecting the presence or absence of one or more polymorphic variations associated with 

osteoarthritis in a nucleic acid sample from a subject, wherein the polymorphic variation is detected in a 
nucleotide sequence selected from the group consisting of: 
(a) a nucleotide sequence in SEQ ED NO: 1-3; 
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(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQIDNO: 1-3; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-3; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 

administering an osteoarthritis prevention or detection procedure to a subject in need thereof 
based upon the presence or absence of the one or more polymorphic variations in the nucleic acid sample. 

36. The method of claim 35, wherein the one or more polymorphic variations are detected at one 
or more positions in claim 3, 4 or 5. 

37. The method of claim 35, wherein the osteoarthritis prevention is selected from the group 
consisting of administering a corticosteroid, a nonsteroidal anti-inflammatory drug (NSAJD), a 
cyclooxygenase-2 (COX-2) inhibitor, an antibody, a glucocorticoid, hyaluronic acid, chondrotin sulfate, 
glucosamine or acetaminophen; prescribing a heat/cold regimen or a joint protection regimen; performing 
joint surgery; prescribing a weight control regimen; and combinations of the foregoing. 

38. A method of targeting information for preventing or treating osteoarthritis to a subject in 
need thereof, which comprises: 

detecting the presence or absence of one or more polymorphic variations associated with 
osteoarthritis in a nucleic acid sample from a subject, wherein the polymorphic variation is detected in a 
nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-3; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQIDNO: 1-3; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1 -3; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 

directing information for preventing or treating osteoarthritis to a subject in need thereof based 
upon the presence or absence of the one or more polymorphic variations in the nucleic acid sample. 

39. The method of claim 38, wherein the one or more polymorphic variations are detected at one 
or more positions in claim 3, 4 or 5. 
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40. A composition comprising a cell from a subject having osteoarthritis or at risk of 
osteoarthritis and an antibody that specifically binds to a protein, polypeptide or peptide encoded by a 
nucleotide sequence identical to or 90% or more identical to a nucleotide sequence in SEQ ID NO: 1-3. 

41 . A composition comprising a cell from a subject having osteoarthritis or at risk of 
osteoarthritis and a RNA, DNA, PNA or ribozyme molecule comprising a nucleotide sequence identical 
to or 90% or more identical to a portion of a nucleotide sequence in SEQ ID NO: 1-3. 

42. The composition of claim 41, wherein the RNA molecule is a short inhibitory RNA 
molecule. 
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Abstract of the Disclosure 



Provided herein are methods for identifying a risk of osteoarthritis in a subject, reagents and kits 
for carrying out the methods, methods for identifying candidate therapeutics for treating osteoarthritis, 
and therapeutic and preventative methods applicable to osteoarthritis. These embodiments are based 
upon an analysis of polymorphic variations in nucleotide sequences within the human genome. 
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